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Abstract 

We consider shared-object systems that require their threads to fulfill the system jobs by 
first acquiring sequentially the objects needed for the jobs and then holding on to them until 
the job completion. Such systems are in the core of a variety of shared-resource allocation 
and synchronization systems. This work opens a new perspective to study the expected job 
delay and throughput analytically, given the possible set of jobs that may join the system 
dynamically. We identify the system dependencies that cause contention among the threads 
as they try to acquire the job objects. We use these observations to define the shared-object 
system equilibria. We note that the system is in equilibrium whenever the rate in which jobs 
arrive at the system matches the job completion rate. These equilibria consider not only the 
job delay but also the job throughput, as well as the time in which each thread blocks other 
threads in order to complete its job. We then further study in detail the thread work cycles 
and, by using a graph representation of the problem, we are able to propose procedures for 
finding and estimating equilibria, i.e., discovering the job delay and throughput, as well as the 
blocking time. To the best of our knowledge, this is a new perspective, that can provide better 
analytical tools for the problem, in order to estimate performance measures similar to ones 
that can be acquired through experimentation on working systems and simulations, e.g., as job 
delay and throughput in (distributed) shared-object systems. 


1 Introduction 

We consider shared-object systems that require their threads to fulfill the system jobs by first 
acquiring sequentially all of the job objects. The job then holds on to these objects until the job 
operation is done. We identify the system dependencies that cause contention among the threads 
as they try to acquire the job objects. We study the (stochastic) processes of job arrival and 
completion with an emphasis on the cases in which the job arrival rate matches the job completion 
rate, i.e., the job throughput. In these cases, the system is in a shared-Object System Equilibrium 
(OSE). For a given e > 0 and an OSE, we say that the system is in an e-OSE when the completion 
rate of any job differs from the one of an OSE by at most e. We study the conditions for a given 
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shared-object system to be in an OSE as well as contention-related properties of OSEs, i.e., the 
expected ]ob delay and completion rate, as well as the time in which each thread blocks other threads 
and by that prevents them from making progress. We propose an analytical procedure for finding 
(in polynomial time) e-OSEs. Moreover, we estimate the performance measures of systems that are 
in e-OSE. 

The existing practice considers job delay and completion rate as the performance measures of 
working systems. Empirical experiments often study shared-resource systems at their saturation 
point in which the system is at its peak utilization. Let us describe peak utilization scenarios using 
two vectors; one for job arrival rates and another for their completion rates. A saturation point is 
the case in which: (1) the system is in equilibrium, i.e., the arrival rate of any particular job matches 
the completion rate of this job, as well as (2) the system is at the stage at which a higher arrival 
rate of any job to the system cannot increase its completion rate. Our study considers the entire 
range of these equilibria rather than just peak utilization scenarios (Section 2). We then propose a 
procedure for finding e-OSEs, if such exist in the given system (Section 3). Once we find an e-OSE, 
we can estimate its performance measures, i.e., job delay, completion rate and blocking time. To this 
end, we develop a number of analytical tools for OSEs. Given the job arrival rates, we show how to 
estimate the probabilities for threads to follow a certain object acquisition sequence (Section 5). We 
are then able to formulate recursive equations (with interdependencies) for calculating the blocking 
periods and the completion rates (sections 6, and respectively, 7). We overcome these dependencies 
and solve these recursive equations by analysing the thread work cycles (Section 8). 

Related Work. Our problem domain considers computing entities, which are called threads. 
Each thread runs a sequential program that has to acquire reusable resources (objects), often several 
at the same time, for a bounded time of use. To guarantee deadlock absence, it is important that 
all threads acquire the objects in an ordered manner. For example, one can deterministically define 
a partial order among the objects, such that the threads acquire them in totally ordered manner. 
We consider a generalization of the dining philosophers problem, as in [15, 17], in which every job 
includes a fixed set of objects that it may need. This problem has well-known results studying the 
worst-case job delays, which may even be exponential on metrics, such as the chromatic number of 
the resource graph [14, 15]. In this graph, the vertices (objects) are connected if there is at least 
one thread that may request them both at any point in time. In the context of actual systems, the 
expected time is rather different than the worst case and therefore computer experiments are the 
common way for evaluating the system performance. We provide a new perspective that enables an 
analysis of the evaluation metrics by considering measures both at the system level and at the level 
of each resource. In particular, we consider performance measures that are associated with each 
resource, such as the delay, completion rate and blocking time. On the system level, we consider 
the job arrival and completion rates, as well as the total number of threads, N, and objects, M. 
Our contribution. We study analytical tools that provide the means to estimate performance 
measures of working distributed systems. In the context of synchronization challenges that are 
modeled via a generalization of the dynamic dining philosophers problem, our analytical tools are 
the first, to the best of our knowledge, to consider performance measures similar to the ones that 
can be acquired via experimentation on working systems and simulations. 

For a given number of threads and job arrival rates, we provide a way to analyze the delay of jobs 
and their completion rates as well as the time for which the threads are blocked. In addition to the 
job completion period (Lemma 3), we analyze a number of key properties, such as the probability to 
request a particular resource after the acquisition of another specific resource, the time during which 
threads that have acquired such a particular resource block other threads that ask to access the same 
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resource (Lemma 4) as well as the time between two requests to access such resources (Lemma 5). 
Since these properties have interdependencies due to thread blocking, we show how the concept 
of thread work cycles can be represented in subsystems that also include such interdependencies 
but have no thread blocking. This way, we can resolve these interdependencies (Theorem 6) and 
estimate the performance of the given (distributed) shared resource system. Moreover, we use the 
work cycle events (sections 2.3 and 2.5) to verify our modeling approach. We present a procedure 
for satisfying approximately the equilibrium conditions and by that find an e-OSE as well as the 
performance measures of the studied system (Section 3). 

Our contribution can facilitate early-stage evaluations of systems that are similar to the studied 
one. Moreover, using our proposed methods, one can analytically, rather than via empirical exper¬ 
iments, study trade-offs among OSEs. Such trade-offs can facilitate the design of mechanisms for 
adjusting the number of threads and job arrival rates according to the performance measures of a 
dynamic system. 


2 Preliminaries 

We consider a system that includes (system) items, which are (totally ordered) objects, (object[ 1], 
..., object[M]), and (totally ordered) threads, {thread[ 1], ..., t.hread[N]). The objects are shared 
in a mutually exclusive way, i.e., only one thread at a time may gain access to an object. Each 
thread is to carry out one job at a time, where jobi = (objsi, operatiorii), J is the number of the 
system’s jobs, i £ [1, J] and objsi = ( object ,: i; ..., object 4 ) is an arbitrary, non-empty subsequence 
of (object[l ],..., object [M]), and thus objsi follows the same order. Note that we assume that 
objsi is a fixed vector and that different jobs may have different object vectors of different lengths. 
Moreover, the (job) operation time, Oi, is a random variable with a known distribution. Namely, 
we assume that the time it takes to execute the job operation is provided, say, via a profiler. 

2.1 Acquisition paths, periods and requests 

Suppose that the system assigns jobi to thread[n} : 1 < n < N. In this case, jobi s (acquisition) path 
is the vector, pathi^n = ( thread[n ], object\i\], ..., object\ik]), in which thread[n\ carries out jobi s 
operation after it has sequentially acquired object^, ..., objecti k . A thread can acquire a particular 
object, object[i\, by pending its (acquisition) request in a (first in, first out) queue Q(object[i\) until 
all (previously) waiting threads in Q(object[i]) have acquired and released object[i\. The acquisition 
period, A, is a known random variable that refers to a period that starts when a thread has acquired 
an object (or just been assigned to a new job) and ends as soon as that thread places a request for 
the next object. Namely, we assume that the time it takes to send a request after a supply event is 
provided, say, via a profiler. Once the thread sequentially acquires the entire object set, object^, 
..., objecti k , it executes the job operation, operation^, before completing the job. We say that a 
thread is blocking when other threads are queuing for its acquired objects. That happens whenever 
different jobs have overlapping object vectors. Note however that threads carry out jobs within 
finite time even in the presence of blocking, because our definition of acquisition paths considers 
object acquisition according to a common (total) order. This work focuses on systems that can 
be in an equilibrium and while in equilibrium it holds that the number of pending requests in 
Q(object[i]) : i £ [1 ,M\ is bounded. 
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2.2 Job arrival rates 


We assume that the time between two consecutive arrivals of jobi to thread[n\ is a random variable 
I[i,n ] (inter-arrival period), where i € [1, J] and n £ [1,1V]. We define the job arrival rate, Aj )Tl , in 
which jobi arrives at the system that then places jobi in a (first in, first out) queue, Q(thread[n\), 
where Ais a positive real number. The inter-arrival period of I[i,n] follows an exponential 
distribution, Exp(\i, n ). Note that this is a common way to model arrivals, e.g. [7]. As soon as 
thread[n ] becomes available, the system assigns to thread[n\ the job that is in Q(thread[n]y s top. 
This work focuses on systems that can be in an equilibrium for which the number of pending jobs 
in Q(thread[n]) is bounded. 


2.3 Work cycles: demand, supply and release 


The thread work cycle, cycle(thread[n\, jobi), refers to the events that occur during the period 
that starts when the system assigns jobi to thread[n] and ends immediately before the next as¬ 
signment of any job to thread[n]. It starts with the event Oi(thread[n ]) in which the system 
assigns jobi to thread[n]. It also includes the events in which: thread[n] demands (requests) ac¬ 
cess to object[k\, denoted by 5i(thread[n], object[k]j), the system supplies (provides) access to 
object[k], denoted by ai(object[k], thread[n ]), and the event in which thread[n] releases object[k\ 
denoted by 4>i(object[k],thread[n]). For simplicity, we refer to the sequence of these release events 
$>i(thread[n}) = ( 4>i{object[k\}, thread[n ]), ..., <f>i(object[kc), thread[n ])) as a single event and as¬ 
sume that thread[n] releases all its acquired objects instantaneously and immediately after the 
operation time, O,;, which is a random variable. Immediately after the event &i(thread[n ]), the 
thread work cycle starts a (possibly zero length) idle period, before the system assigns the next 
(and possibly different than the previous) job to thread[n\ so that the next work cycle begins. 

We assume that events are instantaneous and mark them as points 


on a thread’s work cycle (Figure 1). Note however, that between a 
supply event and a demand event (as well as the last supply event 
and the release event), there is a random length period, i.e. the (ran¬ 
dom) acquisition period A (and the operation time Oi, respectively), 
which refers to scheduling uncertainties. Hence, we denote thread\n\ s 
work cycle due to jobi as cyde(thread[n],jobi) = (■ <Ji{thread[n \), 
Si(thread[n], object[k{\), Oi{object[k \], thread[n]), ..., Si(thread[n], 
object[k(\), <Ji{object[k(\, thread[n]), &i(thread[n\)). 



2.4 Subpaths and acquisition graph 


cycle 


Let s (source) and d (destination) be (possibly consecutive) items on a 
path. We define the set e(s,d) = {(s, •, d) | 3 jobi '■ pathi = (•, s, •, d, 

•)} of all subpaths between s and d, where • denotes a finite, possibly empty, item sequence. The 
acquisition graph, Sf = (V,E), is a simple directed graph, where V = {thread[ 1], ..., thread[N], 
object[1], ..., object[M ]} are the system items (Figure 2). The edges E = {(s, d) | 3 jobi : pathi = 
(•, s, d, •)} are two consecutive items on a path. 
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2.5 Conditional and consecutive events 

We consider an event that occurs at item d (destination) in condition to an 
event occurrence at item s (source), where (•, s, d, •) is a subpath of jobi and 
both events belong to the same work cycle of jobi that thread[n\ carries out. 

Conditional demand and supply events. Denote by <5i, n (d|s) 

= ( 6i(thread[n ], d) \ cr,(s, thread[n ])) the conditional (demand) event, 

Si(thread[n ], d), in which thread[n] requests access to object d immediately 
after the supply event, eq(s, thread[n]). Note that the event cq(s, thread[n ]) 
may refer to: (1) access to object s, or (2) jobi 's assignment to s = thread[n] 

(Figure 1), i.e., oi(thread[n\) = ai(thread[n],thread[n\). E.g., 8i^ n (object[j] 

| object\k\) denotes the conditional demand event (Si(thread[n\, object[j ]) | 

<Ji(object[k\, thread[n\)) in which thread[n\ requests access to object[j] im¬ 
mediately after gaining access to object[k\, where k £ [1, M — 1], j £ ( k, M] and n £ [1, TV]. Another 
example is the case Si t n(object[j] \ thread[n \), where the conditional demand event ( Si(thread[n\, 
object[j ]) | ai(thread[n], thread[n \)) refers to thread[n \s request to access job (s hrst object, 
object[j ], immediately after the assignment of jobi to thread[n}. In a similar manner, denote by 
(l>i,n(d\s) = (4>i(d, thread[n ]) | <r,;(s, thread[n]j) the conditional (release) event, in which thread[n\ 
releases object d at event 4>i(d, thread[n ]) that occurs after the supply event, <r,;(s, thread[n ]) and 
at the same work cycle (we will mainly use (f>i tn (s\s)). 

Events of arbitrary jobs and threads. Sometimes we consider an arbitrary jobi that an 
arbitrary thread[n} carries out. We then write d(d|s), a(s ) and <j(d\s) instead of S,,,(d\s), er...(s), 
and respectively, </>.,. (d\s) when referring to events from the sets n (d|s) : i G [1, J\,n G [1,IV]}, 
Wi,n(s) : i G [1, J\,n G [1,7V]}, and respectively, {(j>i, n (d\s) : i G [1, J],n G [1, AT]}, where i G [1, J] 
and n G [1, TV]. Given subpath (•, l\, ..., Ij., •), denote by S(£k Ki, • • • ,£k- 1 ) the occurrence of 
5(£k\£k-i), which happens immediately after S(tk-i\tk- 2 ), ■ ■ •, A(^ 2 |^i), where k G [1, M\. 
Consecutive events. Let i, j G [1,J], k, £ G [1,M} # n, n' G [1,TV], 5i tn (object[k]\s) and 
Sj, n '(object[(]\s). We say that the event Si tn (object[k]\ s) occurs consecutively after the event 
5j tn ' (object[t] | s), when 5j^ n ' (object[£] \ s ) = ( Sj(thread[n '], object[i \) | <7j(s, thread[n'])) is the first 
conditional demand event, that includes the supply event <Jj(s, thread[n'}), to occur after the con¬ 
ditional demand event Si tn (object[k\ \ s) = (Si(thread[n], object[k \) | cr.i(s, thread[n])), that includes 
the supply event cq(s, thread[n]). 

2.6 Pairwise states and request probabilities 

The definition of the studied equilibria (at the system level) is based on item-level definitions that 
consider f^’s edges, (s, d) G E. We present a definition of the (pairwise) state, c[s, d], which considers 
the delay, blocking and inter-demand periods that are related to the edge (s, d) and its conditional 
events. These periods refer to the time it takes threads to request access to object d, and release 
it subsequently (after the acquisition of item s) as well as the time between such requests that are 
made by (possibly) different threads. Moreover, when estimating the value of the pairwise-state, 
c[s,d], we need to consider the probabilities that are related to the edge ( s,d) and its conditional 
events. 

Pairwise states. We refer to (s,d)’s pairwise state c[s,d] = s.(T[fc], D[k], B[k}) as the tuple 
that includes the request completion rate, request delay, and respectively, blocking period with 
relation to the events <5(<V|s), where d = object[k] is an object and k G [1,M]. The (s,d)’s request 
inter-demand period, s.T[fc], refers to the period between the consecutive events, (Si(thread[n\, d) 



Figure 2: An acqui¬ 
sition graph, ^ 
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Figure 3: (s, d)’s inter-demand period, s.T[k], d = object[k\, i.e., the period between two consecutive 
S(d\s) events. 


| thread[n})) and (. 5j(thread[n '], d) \ <7j(s, thread[n '])) (Figure 3), where i,j £ [1, J] and 
n,n' £ [1,7V], Notice that we can estimate the throughput, 1 /s.T[k\, which is the number of S(d\s) 
requests per time unit. Moreover, we can consider the expected throughput, 1 /E(s.T[k]), where 
E{s.T[k\) is the expected value of s.T[k]. Furthermore, (s, d) ’s delay, s.D[k ], refers to a period that 
starts on the event S(d\s) in which a thread requests access to object d (immediately after gaining 
access to item s) and ends upon the event (j>(d\s) in which that thread releases object d (during 
the same work cycle). In addition, (s,d)'s blocking, s.B[fc], is the fraction of (s,d)'s delay s.D[k ] in 
which a thread blocks other threads from gaining access to object d , i.e., the period between the 
event (<Ji(d, thread[n])\<Ti(s, thread[n\)) in which t.hread[n] gains access to object d, and the event 
</>i(d, thread[n\) in which thread[n} releases d (Figure 4). Note that each of c[s,d]’s three elements 
is a random variable (for which maintaining the first three moments provides sufficient accuracy). 

Pairwise request probabilities. When estimating the pairwise-state c[s, d], 
we use the (pairwise) request probability, R(s,d), of events that are related to 
Sf’s edge (s, d) to occur. When given the history of system events, we define the 
probability of the conditional demand event <f(<i|s) to occur immediacy after the 
supply event a(s). We also consider the case in which the system does not have 
access to this information. In that case, we estimate R(s,d) while assuming 
that S(d\s) occurrence depends only on the system parameters, i.e., N, M, 

{jobi}ie[i,J] and {Ai,n}ie[i,J1,ne[i,iV]) rather than requiring the availability of 
the event history. 

For a randomly chosen work cycle that includes the event, <r(s), of a thread 
gaining access to item s, we define fl(s) = (<5(d|s) : (s, d) £ E is an edge in 
( 0 (s|s)} as the probability space of the possible events to occur immediately 
after er(s). Moreover, R(s,d ) and R(s,s) are the probabilities of fl(s)’s events 
<5(d|s), and respectively, </>(s|s). Namely, R(s,d) denotes the probability of a 
demand event, 5,(thread[»\,d) to occur immediately after the supply event, 
a,(s,thread[» ]) and in the same (randomly chosen) work cycle. Moreover, 

R(s,s) denotes the probability of a release event, thread[»\), to occur immediately after its 
related supply event, cr.(s, thread[»\) and during the same (randomly chosen) work cycle. Note 
that R(s, s)’s definition requires that s is the last object for the thread to gain access to during that 
work cycle. 

Since R(s, d) and R{s, s ) depend on the history of events, we further detail their definitions by 
using the notations R t (s,d) and R t {s,s). We restrict the (random) choice of the work cycle to the 
time interval t = [t start , t en d] and assume the awareness of all events that occurred in the system 



Figure 4: (s,d)'s 
delay s.D[k] and 
blocking s.B[k] 
start from <5(d|s), 
and respectively, 
cr(d|s), and both 
end at <p(d\s), 
d = object[k\ 
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during that period. For the time interval t = [t start, t en d\, we define Rt{s,d) = at{s,d)/r]t{s), to be 
the number of £(d|s) occurrences, over the number of o(s) occurrences and Rt{s,s) = f3t(s) / rjt{s) 
to be the number of </>(s|s) occurrences, over the number of cr(s) occurrences, where fftX denotes 
the number of event X occurrences in t = [t start, tend]- Moreover, at{s,d) = T,i tn #tSi,n{d\s), when 
s = object[j] : j £ [1 ,M — 1], and ott{s,d) = Tii#t5i,n(d\s), when s = thread[n] : n £ [1,1V], the 
number of <5(d|s) events that occurred during t = [t s t a rt , t en d\■ Furthermore, (3t{s) = E*, n (s|s) 
and rjt(s) = T,i^ n f) t (Ji(s,thread[n\) denote the number of <(>(s|s), and respectively, a(s) events that 
occurred during t = [t stor t , tend]■ Note that profiling tools can be the basis for estimating R t (s,d) 
and R t (s, s). We also propose an estimation of R(s, d) and R(s, d) (Section 5) for the case in which 
these probabilities depend only on TV, M, {jobi}t e nn and {Aj ) „}j e r li n ,jv] (and thus t's history 
of events is not required to be available). 

We define the request probability matrix R to be a (N + M) x (N + M) row stochastic matrix. 
The matrix R has a block form, where Rn,n is an N x N zero matrix, Rn,m = ( R{thread[n \, 
object\j])) ne \ i,jv],je[i,j] is an ( N x M) matrix, Rm^n is an M x N zero matrix and Rm,m = 
(. R(object[j ], object[k]))j t k£[i,M] is an (upper triangular) M x M matrix, i.e., 


R = 


Rn,n 

Rn.m 

Rm,n 

Rm,m 


( 1 ) 


2.7 Item inter-demand period 

We refer to object[k\ s inter-demand period , T 0 bject[k\, as the period between two consecutive condi¬ 
tional (demand) events for accessing an object in {object[l\ : £ £ ( k,M )} immediately after gaining 
access to item object[k] by two, possibly different, threads, where k £ [1, M— 1]. Namely, T 0 bject[k\ is 
the period between 6(object[j]\object[k}) and the successive conditional event 6(object\j']\object[k]), 
where j,j' £ ( k , M], We refer to thread[n\ s inter-demand period , TthreadWi] as the period between 
Si(thread[n], object[i-\\) and the demand event Sr (thread[n], object\i ^]), where i,i' £ [1, J] and 
o 6 ject[*i], object\i'i\ are the first objects in the object vectors of jo 6 ,;, and respectively, jobr , where 
thread[n] carries out jobi and jobr consecutively. For example, we can use l/ E(T threa d[n]) f° r 
estimating the number of job completions of thread[n\ per time unit, where E(T S ) is 7^’s expected 
value. In case s = object[d ], 1/E(T ob j ect [d]) estimates the number of job completions that include 
object [d] per time unit. 

We refer to K^ n = Ci tn /Tthread[n\ as jobi s completion rate on thread[n], which is the fraction of 
thread[n\ s inter-demand period, TthreadWi] j due to jobi , where i £ [1, J], n £ [1,1V]. We use Cj, n 
as a constant for which Wi £ [1,1V] : Ci, n = 1, and the weights Ci, n are a function of the job 

arrival rates to thread[n ], for example Ci )U = X,n / (EjLjA^n). 

2.8 Shared-object system equilibria 

For a given system, ^(Sf) = {c[s, d]}( s ,d)e£; is the system state (set), where Sf = (V, E) is the 
acquisition graph. For a given ^(Sf), the set r(Sf) = {Titem}itemev\{object[M]} is the inter-demand 
period of the system. 

Suppose that a system is in a state in which the job arrival rates are equal to the job completion 
rates, i.e., Mi £ [1, J] Vn £ [1,1V], A = Ki >n . We say that = (c*[s, d]}( s ,d)e.E> is the 

shared-Object System Equilibrium (OSE). Given the respective inter-demand period of the 

system is r*(Sf) = {T£ em } it emev\{object[M]}- For a given e > 0 and an OSE we say that 

the system state ip(&) is an e-OSE when V item £V \ [object[M]}, T.* tem £ T*(^),Tit e m G r(Sf) : 
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Estimating R(s, d) (Section 5) We define the following characteristic functions with val¬ 
ues in {0,1}: startsi((si , ..., se)) <tt> objSi = (si, .. se, •), include Si((si, .. se)) 
<S=> objSi = (•, si, se, •) and endsi((si, ..., se}) objSi = (•, si, .. se), where 
(si,...,se) is a vector of objects, objSi is the object vector of jobi and i £ [1, J]. When 
s = thread[n] : n £ [1, iV] and k £ {1,2}, • startSi((k)))/(Yi\i !n ), and respectively, 

(£,:Aj in • include si{{\, 2)))■ includes j((l})) approximate R(s,d). Moreover, Vfc £ {1,2} 
: (Yi tn Ai tn ■ endSi((k)))(Yi jn Xi tn ■ includesi({k))) approximates R(s,s). 

Estimating s.B[k} (Section 6 ) Let s £ {thread[n], object[ 1]}. Observe that the blocking 
periods s.S[2] are only due to paths that finish in object[2 ] and thus there are no dependencies 
for their estimation, i.e., s.B[ 2] = A + R(s, object[ 2]) • R(object[2],object[2]) ■ f St0 bject[ 2 ]- On 
the contrary, the blocking time of a thread’s demand to object[ 1], say thread[n].B[ 1], depends 
on the possibility of thread[n] to demand object[2] and the respective delay, object[l].D[2]. That 
is, thread[n].B[ 1] = A + R(thread[n \, object[l}) ■ R(object[ 1], object[1]) • fthread[n],object[i] + 
R(thread[n], object[ 1]) • R(object[ 1], object[ 2]) • object[l].D[2], where f St d is the average of job 
operation times for jobs with paths (•, s, d) weighed by the probability, R{s , d) • R{d , d), of such 
events to occur. 

Estimating Td (Section 7) Let arrivals(d) be the set of items s, such that a conditional 
demand event <5(d|s) can occur. Object d's inter-demand period is Td = ^searrivais(d)(^{s, 
d) ■ {T s + A + R(s, d) ■ R(d , d) • f Std ))/Y sear rivais(d) w(s, d), where the weight is w(s, object[k ]) 
= R(s , object[k ]) • s.T[k], arrivals(object[ 1]) = thread , while arrivals (object [2]) = threadU 
{object{ 1]} and thread = (thread[ 1], ..., thread[N]). 


Figure 5: Estimating R(s,d), s.B[k] and Td when M = 2. 


| T* t em — Ttem\ < £• Namely, the corresponding values of each item in r(5f) and t*(S?) differ by less 
than e. 

Note that a system cannot always reach a state that satisfies the OSE conditions, and therefore 
an e-OSE. Equilibria are unreachable when there is an item with a blocking period that is longer 
than (or equal to) the inter-arrival time of demand events to that item. For example, when the 
inter-arrival time of object requests is less or equal to the blocking period of that object. Note that 
in that case, the item’s queue is increasing continuously. 


3 The Solution Outline 

We consider the case in which the job arrival rates can become equal to the job completion rates. We 
study how the system satisfies the OSE conditions both in exact and approximated manners. We 
propose a procedure for finding the approximated equilibria, i.e., e-OSEs. This procedure considers 
Sf’s paths, (•, s, d, •), where s is a system item, d = object[k\ and k £ [1, M\. 

3.1 Estimating c[s,d\ and R(s,d ) 

We illustrate a solution for the case of M = 2 objects and N threads (Figure 5) and outline the 
general case solution. 




The pairwise request probabilities. The pairwise state, c[s,d], and request probability, 
R(s,d), are related to the conditional events, <5(d|s). When estimating the value of the pairwise 
state, we first need to estimate the probability for d(d|s) to occur. Our approach considers both 
the case in which R(s , d) and R(s, s) are given and the case in which they depend only on the 
system parameters (Section 2.6), i.e., N, M, {jobi}i e \in and {Aj >n }j e h j^neNAT Using the latter 
assumption, we estimate R(s, d) by the sum of job arrival rates for which threads demand access 
to d after the supply of item s divided by the sum of job arrival rates due to which supply events 
for item s occur (Figure 5). Moreover, we estimate the probability R(s, s ) by the sum of job arrival 
rates such that their object vectors finish with s divided by the sum of job arrival rates due to 
which supply events for item s occur. 

The thread’s blocking periods. Given the pairwise request probabilities, R, we estimate the 
blocking period, s.B[k] (Section 6 ), where d = object[k\. Item d’s blocking period depends in a 
recursive manner on the delay, cf. thread[n\.B[ l]’s dependency on object[l].D[2] in Figure 5, which 
in the general case appears as s.I?[fc]’s dependency on object[k].D[k'] : k' £ (k, M] in Lemma 4. This 
dependency considers the delay that the thread experiences when further acquiring the remaining 
objects t on the path (•, s, d, t, •), which the thread needs for completing its job, where t = object[l] 
and (. € (k, M], Moreover, s.B[k] depends on the job completion period , f St d, due to jobs with paths 
(*,s, d). We refer to (s, d)’s (job) completion period, / S)( z, as the time between the event cr(<i|s), in 
which a thread gains access to object d (after acquiring item s), which is the job’s last object, and 
the event <(>(d|s), in which that thread, immediately after executing the job operation, releases all 
objects (including d) that it had acquired during the work cycle that includes this two events (e.g., 
fitem [i*;_i], item[ik] is the period between <7i k and dq, see Figure 1). We detail the exact way in which 
such forward dependencies exist while the system satisfies the OSE condition (Section 6). 

The item inter-demand period. An item-level balance also exists and it is similar to the one 
that the system keeps for its threads (when satisfying the OSE conditions). Namely, the incoming 
rate of requests (demands) to access object d, has to balance with the inter-demand period, Td, 
which is the time between two consecutive 8{x\d) and 8(y\d) events (where x and y are two, possibly 
different, objects). Note that Td depends on the rate of requests (demands) to access d due to jobs 
with (•, d , d !, •) paths, as well as, (•, d) paths. Thus, we estimate Td, as the sum of the inter¬ 
demand period of every item s, T s , and the job completion period A + R(s , d) ■ R(d, d) • / Si( j, times 
a weight that depends on the pairwise inter-demand period s.T[fc] plus the request probability 
R(s,d), for every item s, such that ( s,d = object[k ]) is an edge in Sf. Note that d’s inter-demand 
period, Td, has a backward dependency, i.e., it depends on the inter-demand periods s.T[k] and T s , 
where (s,d) is an edge in ^ and d = object[k\. We show the exact manner in which the system 
maintains this balance in Figure 5 for the case of M = 2 and in Section 7 for the general case. 

3.2 Resolving interdependencies 

Thus far, we have noticed several dependencies. For example, there are forward dependencies in 
which (s,d)’s blocking period, s.B[k], depends on t’s delay, where d = object[k ], t = object[t\ and 
t £ ( k,M ]. Moreover, there is a backward dependency in which (s,d)’s inter-demand period, Td, 
depends on the summation of pairwise inter-demand period, s.T[k], for any path (•, s,d, •). Note 
that more dependencies exist. The definition of the pairwise state c[s, d], implies, for example, 
that (s,d)’s inter-demand period, s.T[k], depends on s’s inter-demand period. Moreover, d’s delay 
depends on its blocking, s.B[k\, and the (s,d)’s inter-demand period, s.T[k]. Note that these 
pairwise state variables are inter-dependent due to blocking. We show a way to resolve these 
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interdependencies by representing the thread work cycles as a subsystem in a way that is not 
subject to blocking and yet preserves the interdependencies that are related to the paths (•, s, d, •) 
(Section 8 ). This approach for resolving forward and backward interdependencies is the basis of 
the proposed procedure for finding approximate equilibria. 

3.3 Finding approximate equilibria 

We compute an approximated equilibrium, e-OSE, when such is reachable. We propose a procedure 
that always halts (Algorithm 1 presents the solution sketch and we detail the entire procedure in 
Section 9). It returns the system in an e-OSE state whenever the job arrival and completion rates 
become equal, or indicates that the system cannot be in a state of an OSE. 

The procedure starts with a system state that represents the case in which all queues are empty 
(line 4). It then estimates the state of a system in which threads can block one another, and the 
delay grows as more requests are pending in the queues. The procedure works in iterations and 
decides when to stop using the system inter-demand period, {litem} itemev\{object[M]} , he-, it stops 
whenever there is no item £ V \ { object[M ]} for which the change in 7 item is greater than e since 
the previous iteration (lines 6 to 16). 

The procedure repeatedly improves an e-OSE estimation until the system state satisfies the con¬ 
ditions of an approximated equilibrium. It deals with interdependencies using alternating backward 
and forward iterations (lines 7, and respectively, 12). Namely, we resolve the forward dependencies 
in which (s, object[k])' s blocking period, s.B[k], depends on object[tys delay by iterating backwards, 
where t £ ( k , M\. This backward iteration starts from k = M and counts downwards. Moreover, it 
can estimate s.B[k] (in a system that its state satisfies the equilibrium conditions), because all of 
(s, object[k]y s forward dependencies can be resolved. Similarly, we can use forward iterations for 
resolving backward dependencies with respect to d’s item inter-demand period, T 0 bject[k \, because 
all of object[ky s backward dependencies are resolved. 

This loop also updates the thread inter-demand periods, i.e., the time between job completions 
(line 11 ), and exits when no item’s inter-demand period changes by at least £ between every two 
iterations (line 16). Together with the estimation of Tthread[n] > the procedure checks whether the 
OSE condition is violated (Section 7), i.e., if the arrival rate, E/^A of jobs to thread[n], is greater 
or equal than 1 /blocking(n), where blocking(n ) = A + Y,%L 1 R(thread[n], object[k}) ■ thread[n].D[k\ 
is the average time it takes to complete a job for thread[n\, and 1 /blocking[n) is respective rate 
for blocking{n). In case the OSE condition is violated, the loop breaks and the procedure returns. 
Each iteration takes 0(M 2 ■ N 4 + M 3 ) time (see Section 9.5). 

4 Background knowledge 

Our solution uses tools from queueing networks [5]. Although queueing theory celebrated results 
provide closed forms for single queues, e.g., M/M/c, M/G/l [1], and queueing networks, e.g., 
BCMP [3], Gordon-Newell [9], closed form results are far from been the common case. Specifically, 
there are no relevant closed-form results that can be used for systems like ours in which a thread can 
block other threads for a non-exponentially distributed period. Ramesh and Perros [18] consider a 
message passing system of multi-tier server networks in which processes communicate iteratively via 
what is known in the system community as synchronous I/O (and sometimes called blocking I/O). 
Our solution requires resolving interdependencies. We use the thread work cycle for showing that our 
subsystems (Section 8 ) can represent these interdependencies. We then show that Ramesh-Perros 
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Algorithm 1: Finding an £-OSE (procedure sketch) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


Input: M, N, {jobi} ie[li j j, {Ai,„} ne [i iJV ],i6[i,J]> R; 

Output: (thread[l,N],object[l,M - \\, {Tv} v -.v^obiect[M\)\ 

Macro: S^{k) = { thread } U {{ofcject[f]}|f G [1, k — 1]}; 

Start by supposing that all queues are empty; 

repeat 

let prevSet 4— item inter-demand periods; 
for k = M to 1 (* backward iteration *) do 
foreach VSk G ^(fc),Vs G Sk do 

Estimate s.T[fc], s.D[k] by resolving the (s, o&j'ect[fc] (-subsystem (Section 8); 
Estimate s.S[fc] (Section 6); 


11 

12 

13 

14 

15 


foreach n G [1, N ] do Estimate 7thread[n] and upon OSE condition violation call return(‘no 

OSE’); 

for k = 1 to M — 1 (* forward iteration *) do 
foreach VSfc G d^(k),\/s G Sk do 

j_ Estimate s.T[fc], s.D[k] by resolving the (s, object[fc](-subsystem (Section 8); 

Estimate T 0 bj ec t[k] (Section 7); 


16 until the system has reached equilibrium (test for e-OSE using the inter-demand period of every 
item)-, 

17 return (thread[ 1, N], object[ 1, M - 1], {Tv}v:v^object[M\)\ 


subsystems [18] can analyze our subsystems and resolve their interdependencies iteratively. We find 
e-OSEs in a similar manner. Namely, we use a framework proposed by Baynat and Dallery [4] for 
estimating the system state, in a similar manner to Ramesli-Perros [18]. The authors of [18, 4] 
demonstrate the convergence of their iterative methods via numerical experiments. Baynat and 
Dallery [4] show that each iteration has polynomial running time, which is 0(M ■ IV 4 ) for the OSE 
case (Lemma 9). 

In the remainder of this section we present the stochastic process through which we calculate the 
probability of a thread to be idle, using the job arrival process and the time that it takes a thread 
to complete an arbitrary job (Section 4.1), as well as, a version of the Baynat-Dallery framework 
adapted to shared-object systems (Section 4.2). 

4.1 Idle thread probability 

We analyze the stochastic process in which jobs arrive in a shared-object system and are assigned 
to a thread, thread[n \, whenever a previously assigned job is completed (departure) or the thread 
is idle. This process is characterized by the inter-arrival time of jobs to thread[n\, I n and the 
respective blocking period B n , during which thread[n\ carries out the job. We explain how an 
existing method can help us to obtain the probability of thread[n] to be idle, u n (unoccupied). 

We define a Markov chain that has the structure of a quasi-birth-death process (QBD) [8], using 
/„ and B n . We consider general distributions of arrival I n and departure B n stochastic processes, to 
which we match Coxian-2 distributions, using their first three moments [2]. We consider (Section 2) 
the arrival process of jobi to thread[n} to follow an exponential distribution I[i,n\ ~ Exp(\i jn ), 
as in [7]. Therefore, we can assume that I n follow an exponential distribution with parameter 
E/ = iA,; in , where J is the number of the system’s jobs (as in [8]). Moreover, we obtain the first three 
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moments of B n by the moments of the thread[n]' s inter-demand period, Tthread,[n\ > that we calculate 
in Section 7. By applying moment matching to the first three moments of /„ and B n , we obtain 
the arrival, and respectively, departure rates of the continuous-time Markov chain. Thus, we can 
define a quasi-birth-death (QBD) process that considers the growth and decrease in the number of 
pending jobs to be assigned to thread[n\ , where each state determines the arrival and departure 
state of the respective Coxian-2 distribution. 

We apply the Matrix Geometric Method (MGM) [16] to find the steady-state probabilities for 
each state of thread[n .], due to the Markov chain’s QBD structure. We then obtain u n directly since 
it equals the sum of the QBD process’ probabilities for having no pending jobs, i.e., u n = u n j +it ri) 2 , 
where Ui )Tl is the steady-state probability of having no pending jobs and the arrival of a new job 
being while on phase i of the Coxian-2 distribution, i £ {1,2}. The MGM is an iterative method 
and its running time is in 0(1 mgm ■ m 3 ), where to is the maximum number of states in each QBD 
level and Imgm is the number of iterations until the method converges. Since for our purposes, 
m is a constant, e.g., to = 4 in the modeling described here, Imgm is a small number and the 
method converges quadratically, we assume that the running time of the MGM is (practically) 
constant. We base our assumption that Imgm is a small number on an example that Latouche and 
Ramaswami [13] give. They argue that in practical settings, Imgm is a small number, considering 
the case of Imgm > 40. Here, J > 10 12 , where J is the number of jobs. 

4.2 Baynat-Dallery framework 

As a background knowledge, we discuss a variation on Baynat and Dallery’s framework [4] that 
we adapt to the context of shared-object systems (Algorithm 2). The BDFQ function denotes 
our adapted version of Baynat and Dallery’s framework. For every s £ Sk, this function takes a 
contention subsystem, CS(Sk,k), which is the tuple (H(Sk,k), (7 Z s ) s& s k , (B s ) s ^s k ), as an input 
and returns an estimation of the delay s.D[k] and inter-demand period s.T[k], Namely, (s.T[/c], 
s.D[k]) sGSk = BDF(CS(Sk , k)). The solution of Baynat and Dallery is based on iterative approxi¬ 
mations of the demand arrival rates and request completion rate to object[k\ with the ones in the 
subgraph 'H s {Sk,k) = (V s ,£ s ), and vice versa, until, for every s £ Sk , their absolute difference is 
below a given threshold. 

We complete this section with a detailed explanation of Algorithm 2. The procedure starts by 
an initialization phase (lines 7-9), which is followed by a repeat-until loop (lines 10-25) and the 
output calculation (lines 26-27) before returning the output (line 28). 

Variables. Lemma 7 of Appendix 8.3 shows that a contention subsystem (Section 8.1) represents 
the dependencies among the threads in a shared-object system with respect to its state. Let s £ Sk 
denote a thread, if Sk = thread , or an object, if Sk = {object[j}}, where j £ [l,fc — 1]. For item 
v 6 V s , we define X s (d), B s (v) and C s (v) to be item ids the inter-arrival time I s (v), blocking period 
B s (v), and respectively, inter-demand period C„(v). With respect to the BDF () function, I s (v) is 
the time between two demand events to item v, B s (y) is the blocking period of an arbitrary demand 
event to v, and respectively, C s (v) is the time between two release events on v. Note that when s is 
a thread ( Sk = thread), I s (s) is the time between two object release events by s, B s (s) is the time 
from an object release event until the next job completion by s, and respectively, C s (s) is the time 
between two job completions by s. Baynat and Dallery approximate X s (i>), B s (v) and C s (v) using 
exponential distributions with parameters (rates) 7 s (i’), /i s ( v), and respectively, v s {v). Note that 
these random variables depend only on s and v, due to the definition of the contention subsystem 
(Section 8.1) and Lemma 7 of Appendix 8.3. The BDF() function estimates 7 s (i’), /-i s (v) and v s {v) 
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and uses them to compute s.D[k\ and s.T[k] for every s £ Sk- 

We define the subgraph / H s (Sk,k) = (V a ,£ a ) of / H(Sk,k ) (Section 8.1). Note that when Sk = 
thread, thread s = thread[n\ requests to access only to items in / H s (Sk,k) , s subgraph. Similarly, 
when Sk = {object[£}}, a thread that has access to s = object[£\ requests to access only to 'H s (Sk, k)'s 
items, where £ £ [1, fe— 1]. Baynat and Dallery treat these subgraphs as Gordon-Newell networks [9, 
6 ]. The BDF(y s repeat-until loop (lines 10-25) alternates between computing 7 s (u), p s (u) and 
v s (v) for the Gordon-Newell network defined by the subgraph R s (Sk,k), for every s £ Sk and the 
same values for every individual item v £ V. The iterations stop when for two consecutive loops 
there is no s £ Sk and v £ V„, such that the values of n 3 (v) differ more than a given e (line 25). 
Initialization phase. The BDF() function initializes n s (v) with 1/E(B s [v)) (lines 7-8) and com¬ 
putes the steady-state probabilities of a thread to demand or have access to item v, after the supply 
of item s. It does that through the function stationary^), which takes the stochastic matrix 1Z S as 
an input. The function stationary () outputs the steady state vector, steadyStateProbabilities s , 
which has the size of |V S |. Moreover, steadyStateProbabilities s satisfies the equations tt ■ 1Z S = n 
and r,; = 1 (line 9). 

The repeat-until loop. The BDF() function’s repeat-until loop calculates J s (v) (lines 12-17), 
v s (v) (lines 18-22) and fi s (v) (lines 23-24). We calculate the Gordon-Newell normalizing constant 
(line 13) and the marginal probabilities of Gordon-Newell (lines 14-16). Using these marginal 
probabilities, we calculate 'y s (v) for every s £ Sk and v £V 3 (line 17). 

We find the marginal probability of an item to be idle through the idleProb() function (lines 18- 
21) and then calculate a s (v) (line 22) for every s £ Z{ v), where Z(v) = {s £ Sk \ v £ V s } (line 5). 
The idleProbf) function calculates item v’s marginal probability to be idle through the underlying 
Markov chain of a multi-class queue with exponential arrivals ( 7 s (u) for every s £ Z(v)). It also 
calculates the blocking periods (/z s (i>) for every s £ Z(v)). Note that the queue length is limited 
by the maximum number of pending demands (TV when Sk = thread and 1 when Sk = {object[j]}, 
j £ [1 ,k 1]) [4, 5]. 

The calculation p s (u)’s new estimates (lines 23-24) happens before the next iteration. In order 
to check the convergence condition, each iteration begins with storing in the oldValues variable the 
last estimations of y s (v) for every s £ Sk and v £ V s (line 25). 

BDFO's output. We estimate s.D[k] through the delay in object[k\ s queue in the contention 
subsystem (line 27). Moreover, we obtain an estimation of the inter-demand period s.T[k\ through 
1 / 7 S (object [A;]) (line 27). The BDF () function returns the output in line 28. 


5 Request probabilities 

The fact that the pairwise request probabilities depend on the arrival rates of the corresponding 
jobs is the basis of our estimation of R(s,d) (Lemma 1) as in the network approximations [8], 
where (s,d) is an edge in <$. In Lemma 2 we prove that our estimations of R(s,d) and R(s,s ) 
define indeed a probability, i.e., for any item s, R(s,s ) + T,d 7 t s R(s,d) = 1. This implies that the 
probability matrix R, which contains the estimates of R(s,d) and R(s,s), is a stochastic matrix. 

Lemma 1. Equation 2 and Equation 3 approximate R(s, d), when s = thread[n] : n £ [1, AT], and 
respectively, s = object[j] : j £ [1,M — 1] , see Figure 5 for the definitions of the characteristic 
functions startsf), includesQ and endsf). Moreover, for any object s, Equation j approximates 
R{s, s). 
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Algorithm 2 : The BDFQ function for estimating delay and pairwise inter-demand period 
through a contention subsystem 

1 Input: CS(Sk, k) = (' H(S k , k), (K s ) seSk , (. B s ) seSk ); 

2 Output: (s.T[fc], s.D[k]) s< z Sk ; 

3 Macros: 

4 converged(jprev,curr) = 0y s (v) £ prev, y' s (v) £ curr : \p, s (v) — p.' s (v)\ > e); 

5 Z(v) = (s G S k \v £ V s }; 

6 begin 

7 foreach s £ S k do 

8 |_ foreach v £ V s do p s (v) £- l/E(B s (v)); 

9 foreach s £ S k do (steady StateProbabilities s ( i>))„ g y s 4 — stationary( 1 Z S ); 

10 repeat 

n let oldValues = (fi s (v)) se s k ,veV, 

12 foreach s £ S k do 

13 GordonNewellConstants 4— Z, ve Q^steadyStateProbabilities s (v) / y s (v); 

14 foreach v £ 14 do 

is subgraphM arginalProbs 8 , v ( 1 ) 

steadyStateProbabilities s (v)/(ii s (y) ■ GordonNewellConstants); 

16 subgraphM arginalProbs s , v ( 0 ) 4— 1 — subgraphM arginalProbs SiV ( 1); 

17 7 s(v) 

fi s (■ v ) • ( subgraphMarginalProbs s , v ( 1)/ subgraphMarginalProbs SyV (0 )); 

is foreach v £ V do 

19 foreach s £ Z(y ) do 

20 itemMarginalProbs s<v ( 0) £- 

idleProb(v, s, { 7 s(«)}ses fe , {Ms(^)}ses fc , (^s)sesj; 

21 itemM arginal Probs StV (l) £- 1 — itemMarginalProbs s , v ( 0 ); 

22 foreach s £ Z(v) do 

v s (v) 4- 7 S (v) ( item.MarginalProbs s , v (0)/ itemMarginalProbs s , v (1)); 

23 foreach s £ S k do 

24 |_ foreach v £ 14 do p s (v) 4 — v s (v); 

25 until converged(oldValues, {p s (v)} S £s k ,vev); 

26 foreach s £ S k do 

27 let (s.D[k], s.T[fe]) = 

(1/(p s (object[k]) ■ subgraphMarginalProbs s o bject[k\{ 0)), l/7s (object [fe]); 

28 return (s.T[fc],s.£)[fc]) se 5 fe ; 


( 2 ) 

(3) 


R(thread[n], d ) 


(EiAi.n • startsi({d))) 


R(object\j], d) 


(Sj Xi,n) 

EiAi.n • includesi((object\j], d)) 
SiAi, n • includesiUobjectlj])) 
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R(s, s) 


(4) 


{'Ei,n\i,n-endsi((s))) 

(Ei,n\i,n-includesi((s))) 

Equation 2 estimates R(thread[n], d) by the sum of arrival rates of jobs that start with object d 
to thread[n ], divided by the sum of the arrival rates of all the jobs that are assigned to thread[n]. 
Equation 3 estimates R(s, d) by the sum of arrival rates of jobs that include the subvector (s, d) 
in their object vector (to any thread), divided by the sum of arrival rates of jobs that include 
the subvector (s) in their object vector (to any thread), where s = object[j] and j £ [1,M — 1]. 
Equation 4 estimates R(s, s) by the sum of arrival rates of jobs that their object vector ends with 
the subvector (s) (to any thread), divided by the sum of arrival rates of jobs that include the 
subvector (s) in their object vector (to any thread), where s = object[j] and j £ [1 ,M], Note that 
in any other case, we define R(s,d) = 0, since no conditional demand events 5(d|s) or conditional 
release events <^(s|s) occur in these cases. 

We prove that the estimation of the request probabilities in Lemma 1 also defines a probability 
(Lemma 2). 


Lemma 2. For R(s,d)’s estimation (Lemma 1), it holds that: (1) T>dR(thread[n\, d) = 1, where 
n £ [1,1V] and d is an object, (2) R(s,s) + E^ s i?(s, d) = 1, and (3) R is a row stochastic matrix. 


Proof. Equation 5 demonstrates claim (1). 


T,dR(thread[n\, d) = 


E d T,jXn,i • startsj((d)) 

E^ A^y 

Y,. l X n ,i'£'dStartSi((d}) 

Ei \n,i 

E,A n ., • 1 


(5) 


Ei Xn i 


= 1 


For any object s ^ object[M ], Equation 6 demonstrates claim (2) due to Equation 7, which holds 
since a job object vector that includes object s, either ends with s or includes more items d ^ s. 

R(s,s) +T,d?sR(s,d) ( 6 ) 

E n,iXn,i ' endsi(^(,s'j^ ^ iXn,i * includesi((,s, d)) 

S„,iA n ,i • includesi((s)) E n ,iA„,i • includesi({s)) 

S„,jA n ,i ■ endsj((s )) + ^d^s^n,iX n ,i ■ includesj{(s, d)) 

E„,iA„,i • includesi((s)) 

Sn,jAn,i ( endsi((s )) + S d^ s includesi((s,d))) 

E n ,jA„,i • includesi((s)) 

S„,jAn,i • includes i({s)) 

S n ,iXn,i ■ includesi((s)) 

= 1 


endsi((s)) + d^tsecludeSi{(s, d)) = include s,;((s)) 


(7) 
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As for claim (3), we note that claims (1) and (2) imply that our estimation of the block matrices 
Rn,m and Rm,m of the matrix R are row stochastic (Equation 1). Since the block matrices Rn,n 
and Rm,n are zero matrices, our estimation of R forms a row stochastic matrix. 


6 Blocking Periods 

We estimate (s, d)’s the blocking period, s.B[k\, using the request probabilities and the job comple¬ 
tion periods, where d = object[k\, k £ [1 ,M] and (s,d) £ E is an edge in Sf . This blocking period 
is an effect of multiple threads’ job paths, i.e., the (*,s,d) and the remaining (*,s, *,d, •) paths. 
The former case corresponds to the job completion period (Lemma 3), whereas the latter depends 
on the delay of acquiring the path remaining objects (Lemma 4). 

Job Completion Periods. In the case of (•, s, d, •) paths, the period for acquiring the remain¬ 
ing objects varies according to (s,d)’s delay, the job completion period depends only on known 
distributions (the job operation times), and the probability that the related events occur. However, 
for the case of (•, s, d) paths, f Si d is the operation time average due to jobs with (»,s,d) paths, 
weighted by the probability for the related events to occur (Lemma 3). This allows us to estimate 
(s,g?)’s blocking time, s.B[k\. Lemma 3 defines f s ,d = £^_i weighti(s,d) ■ 0; as a weighted average 
of all the operation times, where weighty, d) = (EA ■ endsi((s, d)) / W St d, if s is an object, 
weighti(s,d) = Ai >n • endsi((s,d)) / W S} d, if s = thread[n\ and W s ,d = E f =1 weighti(s,d) is the 
sum of all weights. Note that the weights depend on the arrival rates of jobs with object vectors 
that end with ( s,d ), if s is an object and (d), if s is a thread. 

Lemma 3. ( s,d)’s completion period is a weighted average, f S) d = E f =1 weighti(s, d) ■ Oi, of the 
respective job operation times. 

Acquiring the remaining objects. We give an example of how to estimate s.B[k] in a system 
with two objects (Figure 5). For the general case of M objects, we also have to account for (•, s, 

• , r, d, •) paths, as in the proof of Lemma 4. 

Lemma 4. The (s, d) ’s blocking period can be estimated by s.B[k ] = A + R(s, object[k\) ■ R(object[k], 
object[k ]) • f s,object[k] + E£L fc+1 (Pr(p = p(k')) ■ object[k\.D[k'}), where p = (•, s, •, object[k\, •) 
is a path that includes items s and d = object[k], as well as Pr(p = p{k')) is the probability of 
having p{k') = (•, s, •, object[k], object[k'], •) as a path. 

Proof. Let d = object[k\ and d' = object[k']. We define the probability Pr(p = (•, s, •, d, d', •)) = 
R(s, d) ■ R(d , d') + T. k r z\ +1 R{s, object[r ]) • [E {6bjecb\r\, d)] ■ R(d , d') of a path p , where s = 
0, if s = thread[n \, s = j, if s = object[j]. R J (a, /3) is the probability that the path (a, •, /3) includes 
j — 1 intermediate items between a and /?, or equivalently, the probability of an Sf’s path from a to 
j3 to include j edges, which is given by the (a, jd) element of the j -th power of the stochastic matrix 
R. Lemma 4’s proof details s.B[fe]’s forward dependency via its recursive equation that estimates 
by computing s.B[M] blocking periods, before s.B[M — 1] and so on. Moreover, it depends on 
the delays, object[k\.D[k’], for acquiring the remaining job objects. We use backward iterations 
(Section 3) to resolve these dependencies (Section 8 ). 
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7 Item inter-demand periods 

The item inter-demand period allows us to decide on £-OSE’s condition satisfaction. We estimate 
the item inter-demand period, which together with the blocking period, s.B[k] (Section 6), are 
essential for calculating the delay, s.D[k] (Section 8), where d = object[k ], k G [1, M] and (s, d) G E 
is an edge in ^. 

Object inter-demand period. Recall that item d’s inter-demand period, 7d, is the time 
between two demand completions on item d , which in equilibrium equals the time between two 
demand arrivals to d. Therefore, our inter-demand period estimation, 7 ~d, for an object d depends 
on (s,d)’s inter-demand periods, which are the time intervals between two consecutive demands to 
d that first acquire item s, the inter-demand period s.T[k ], and the period during which the threads 
complete such demands. 

The estimation of object ef s inter-demand period, 7 ~d, is the average of s’s inter-demand periods 
plus the time that demands for (»,s,d) paths block d. where (s,d) is an edge in Sf. This time of 
demands is A + R(s,d) • R(d, d) ■ f S: d, weighted by the probability of (•, s, d, •) paths and the 
(s,d)’s inter-demand period, s.T[k\, of demands to d after the supply of item s, due to (»,s,d, •) 
paths (the weights are normalized by their sum). Namely, the average sums the period between two 
demand completions of an item s and the job completion time, and the weights incorporate the item 
dependencies through s.T[fc] for all (•, s, d, •) paths and their probability to occur (d = object[k\). 
By generalizing the example system with M = 2 objects and N threads (Figure 5) for every 
(s, d) G E, we obtain Lemma 5. 

Lemma 5. The inter-demand period of d = object[k] is: Td = (E s earrivals(d) w(s, d) ■ (71 + A + 
R(s,d) ■ R(d 1 d) ■ f s ,d)) / (£ S £arrivais(d) w(s, d)), where arrivals(object[k\) = thread U { object[j] | 
j G [l,fc — 1]} and co(s, object[k ]) = R(s, object[k\) ■ s.T[k ]. 

Lemma 5 details Td's backward dependency via its recursive equation that calculates T 0 b jec t[j ], 
j G [1, fc—1] before calculating T 0 bject\k] ■ Moreover, the calculation of 7thread[n\ and T 0 bject[k\ depends 
on the inter-demand periods, which is thread[n].T[k\, of ( thread[n],object[k\ ) and respectively, 
s.T[k], where k G [1 ,M\ and s G thread U {object\j] \ j G [1 ,k — 1]}. We use forward iterations 
(Section 3) for resolving such dependencies (Section 8). 

Thread inter-demand period. We estimate thread[n\' s inter-demand period, Tthread[n \> by a 
Markov process that considers both the arrival process of jobs to thread[n ] and the time it takes 
thread[n ] to complete a job. 

The way that we estimate 7thread[n\ j uses the augmntThreadBlock(I(n ), blocking(n)) function. 
The function input includes the inter-arrival times I(n) and blocking(n) = A + Ej ^L 1 R(thread[n], 
object[k ]) • thread[n].D[k], i.e., the average time it takes thread[n] to complete a job. Moreover, 
augmntThreadBlock(I(n), blocking(n)) outputs the estimation of 7 thread[n\ an d checks if the OSE 
condition is violated (line 11 in Algorithm 1 of Section 3.3), i.e., the rate Tf =1 A l n that defines I(n) 
(we assume I(n) to be exponentially distributed, as in [7]) is greater or equal than 1 /blocking(n). 
The function analyzes a queue using Queuing Theory [1], In our case, we characterize the queue by 
matching the first three moments of I(n), and respectively, blocking(n) to Coxian-2 distributions 
(Section 4.1). 

Recall that Tthread[n\ denotes the period between two consecutive 6(d\thread[n]) and S(d'\thread[n]) 
events. The system assigns a job pending to a thread’s queue, say thread[n\, whenever that thread 
becomes available, where n G [1, N], After this assignment (and a random acquisition period of A), 
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the event S(d\thread[n]) occurs in which the thread requests to access the job’s first object; denoted 
by d. 

We estimate Tthread[n\ by a Markov chain that depicts both I(n) and blocking(n) (as in Ramesh 
and Perros [18] but with adaptation to shared-object systems). In Figure 6 , ai, 01 and a define the 
Coxian-2 job arrival process at thread[n\ and v\, V 2 and q define the Coxian-2 of the thread[n\ s 
job completion period ( blocking(n )), which we obtain by applying moment matching [ 2 ] on the 
first three moments of I(n) and blocking(n). We denote with Uj the probability that upon an 
arbitrary job completion thread[n] becomes idle and the job arrival process is in phase j £ {1, 2 }, 
as in [18] (Section 4.1). We define T t hread\n\ to follow a phase-type distribution (PH) [ 1 ], which is 
defined using the initial probability vector, c = (iii , 112, 1 — iq — U2, 0 ), and a transition matrix 

in its block form, the ( x,y ) element of S, S{x,y), equals the 

rate in state x times the transition probability to state y, x,y £ {oi, 02 , rq), and S° = —SI. 
The ?n-th moment of T th read[n\ is E [T t ™ read[n] ] = (-l) m m!c5“ m l [ 1 ]. 

8 Resolving Dependencies 

We showed how to estimate the blocking period on an object, s.B[k ], while depending on the delay 
for acquiring other objects, object[k].D[k'] (Section 6 ), as well as how to estimate the inter-demand 
periods Tthread[n\ and Tobject[k]i while depending on the s.T[k] pairwise inter-demand periods, due 
to (•, s, object[k ], •) paths (Section 7). Recall that these variables are inter-dependent due to 
blocking. Theorem 6 demonstrates that we can resolve these interdependencies by representing the 
thread work cycles as a subsystem in a way that is not subject to blocking and yet preserves these 
interdependencies. After the definition of the subsystem, we provide the key steps of the proof by 
looking into the case of M = 3 (Section 8.2). We prove Theorem 6 ’s in Section 8.3 using background 
knowledge [18] (which we refer to in Section 4). 

Theorem 6 . Let s £ Sk and d = object[k], where Sk £ {thread} U {{ object[j ]} | j £ [l,fc)}. 
The Baynat-Dallery framework can approximate the delay s.D[k] and pairwise inter-demand period 
s.T[k] through the contention subsystem CS(Sk, k) = {Tl{Sk, k), (7 Z s ) s ^s k , {Bs)ses k ), where Sk 6 
{thread} U Sk and Sk = {{object[i\} \ i £ [1, k — 1]}. The running time of each framework iteration 
is 0(M ■ N 4 ). 

8.1 Contention subsystems 

Given any pair of system items, s and d = object[k ], we calculate the (pairwise) state c[s, d] for i^’s 
edge (s, d) using a construction that we name contention subsystem CS(Sk,k), where k £ [1,M], 


Q (Figure 6 ), where Q = 
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s £ Sk and Sk is either the set of all threads, Sk = thread, representing [t.hread[n], d, •) job paths, or 
a set including a single object, Sk = {object[j}}, j £ [1, k — 1], representing (•, object[j], d, •) paths. 
For every item s £ Sk, we use the thread work cycle, cycle{thread[n], jobj) of thread[n], n £ [1, N], 
carrying out jobj , (Section 2), to show that the contention subsystem CS(Sk, k) represents the state 
of the shared-object system, with respect to the interdependencies among the delay s.D[k] and the 
pairwise inter-demand period s.T[k] when the blocking time is known. 

Let object[k] £ {object[ 1], ..., object[M ]} as well as Sk = {{object[ 1]}, ..., {object[k— 1]}} (i.e., 
Sk = 0 when k = 1) and Sk £ {thread} U Sk, where thread = {thread{ 1], ..., thread[N]}. Moreover, 
let Rel{thread , k) = [1, M] \ {fc} and Rel{{object[i ]}, k) = [i + 1 ,M\ \ {fc}, where object[i\ £ Sk- 
We partition the (•, s, •) paths to three sets, V(s,d) = where V\ = {path \ path = (•, s, 

d, •)}, V 2 = {path\path = (•, s, object[i], •, d, •) A i £ Rel{Sk, k) \ [k+1, M]}, V 3 = {path\path 
= (•, s, *) A d path} and Rel(t,hread , k) = [1, M] \ {fc}. 

A contention subsystem, is denoted by CS{Sk, k) = ( H(Sk, k), (lZ s ) s ^s k , (Bs)ses k ) and defined 
by (l)-(3). 

(1) The contention graph R{Sk, k) = (V,£) has the set of vertices V = U se s fe V s , and the set of 
edges £ = U se s fc £ s , such that for every s £ Sk, V s = {s} U {d} U {relay(s,j) \ j £ Rel{Sk, k)} and 
£ s = £} Uf s 2 U £l, where £] = {(s, d), ( d, s)}, £ 2 S = U ieJJei(Sfcife) \ [fc+1>M] {(s, relay{s, j)), ( relay{s , 
j), d), {d, s)}, and £% = U jeR e i(s k ,k){(s, relay{s, j)), ( relay{s, j), s)}, i.e., £!} corresponds to the 
partition Vk, where k £ [1,3]. Note that H{Sk, k ) is a simple graph, i.e., there are no multiple edges 
between two vertices. Moreover, relay(s,j) is a distinct copy of a relay object[j}, j £ Rel{Sk,l), for 
S £ Sk- 

(2) The request probability matrices 1Z S for R(Sk, k ) = (V, £) and s £ Sk- 7 Z s (s, d) depicts the 
probability of a path (•, s, d, •). Moreover, lZ s {s, relay(s, j)) depicts the probability of a path r 
= (•, s, object[j ], •), while lZ s (relay(s, j), d) depicts the probability that r = (•, s, object[j ], •, 
d, •). Furthermore, 1Z s {v, s), v £ V s \ {s}, depicts the probability of a thread becoming idle or 
starting a new job after the completion of a job, which is a certain event and therefore TZ s (v, s) = 1, 
v £V S \ {s}. 

(3) The blocking periods, (£> s ) se s fc , where B s is a function over the set of items in V s , and s £ Sk 
refers to the thread blocking periods on each of V s ’s items. Note that E l s forms a directed circle 
in 77(S'fe,/c), where s £ Sk and i £ [1,3]. A demand request to d that follows the supply of s £ Sk 
and possibly the supply of a relay object, object[j], j £ Rel{Sk,k) \ [k + 1 ,M], is blocking d for 
B s {d) = s.B[k] time. A demand request to server relay(s,j) that follows the supply of s £ Sk, 
blocks that server for a period of B s (relay(s, j)) = s.D\j], if j £ [k + 1 ,M] and B s (relay(s, j)) = 
W(s,relay(s,j), d), if j £ Rel(Sk, k) \ [k + 1, M], where W(s , relay{s,j), d) equals the delay s.D[j] 
minus the blocking period of a possibly subsequent demand event to d = object[k\. Once a job is 
completed, another demand event follows the supply of s £ V s after a period of B s (s) = T s . 

8.2 The case of systems with M — 3 objects 

We use an illustrative example that shows how the contention subsystem of Sk = thread and 
d = object[ 2] represents the dependencies among the threads of a system with M = 3 objects 
and N threads, with respect to the delays and pairwise inter-demand periods when the blocking 
times and the item inter-demand periods are known. We construct the contention subsystem 
CS(thread, 2) = (' H(thread, 2), (TZ s )sethread, (B s ) S £thread) based on the work cycles related to 
the delay thread[n\.D[ 2] and inter-demand period thread[n\.T[ 2], for every s = thread[n\ £ thread 
and d = object[ 2]. We explain the representation of work cycles by a contention graph, which is 


19 





5i(thread[n\, object[ 2]) 


5i(thread[n\, object[l\) 


(a) Contention graph for 

M = 3 (b) The case of Vi 


' Si(thread[n], object[2]) 

(object[2\, thread[n\) \ / <Ji{object[2),thread[n)) 

<j>i(object [2]{threa5{n\) 

(c) The case of V 2 



(d) The case of V 3 


Figure 7: The contention graph for CS{thread, 2) and the work cycles partitions, V{s,d ) = 
u fe[ i,3 ]7V of (s, •) paths, where Vi = {x \ X = (s, d, •)}, V 2 = {x \ X = (s, object[ 1 ], d, 
•)}, 7^3 = {x | X = (s, •) A d £ x}i s = thread[n] and d = object[ 2]. 


illustrated in Figure 7a, and the adaptation of the request probabilities and blocking times to the 
ones of the contention subsystem. The challenge here is to demonstrate that a dynamic system 
that is based on correlated events with dependencies that are due to blocking and follow non- 
deterministic schedules can be represented by these subsystems. After demonstrating this part 
of the proof, the rest of the proof follows by matching between the subsystems presented here to 
the one by Ramesh-Perros [18], which use a framework proposed by Baynat and Dallery [4] for 
estimating our system’s state. 

Contention graph of CS{thread, 2). Let TL{thread, 2) = (V,£) for Sk = thread and d = 
object[2]. (Figure 7a). Given an arbitrary thread, s = thread[n ], n £ [1,1V], let V{s, d) = 
U /e[ i,3]P/ be a partition of (s, •) paths, where V\ = {path \ path = {s, d, •)}, V 2 = {path \ path 
= ( s , object{ 1], d, •)} and V 3 = {path \ path = (s, •) A d path}. Note that for brevity, we 
write s = thread[n], 01 = object[l], 03 = object[ 3], and d = object[ 2] throughout this example, 
but we clarify this notation when needed. Moreover, let V = U rte n /yi V s be the union of V s = {s, 
relayd , relay{s, 3)}, where relay(s, j), j £ {1, 3}, are s’s distinct copies of a relay object, 
objectfj], which allow us to distinguish paths with respect to threads. The contention graph’s 
nodes s, relay(s,j) and d = object[k\ represent s, object[j] and, respectively d, in the shared-object 
system, where j £ {1,3}. 

The edges £ = U ne n :v]£.s follow the path partition cases, 31. Let job,, be a job that 

s = thread[n] carries out. The edge sets £}, where £ £ [1,3], are defined as follows. 

• Vi’s case refers to work cycles, for which s demands access to d, once it is assigned with jobi. 
When s gains access to d, jobi might require s to demand access to 03. Upon jobi’s completion 
s releases any acquired object. Thus, the edges in £\ = {(s, d), (d, s)} (Figure 7a) represent the 
work cycle subvectors (<5,:(s, d)), and respectively, (cr,;(d, s), ..., </>i(d, s), ...) (Figure 7b). 

• Vi’s case refers to work cycles, for which s’s demand for access to 01 is followed by s ’s demand 
for access to d after 01’s supply, which is then followed by jobi’s completion. Thus, the edges in the 
set £2 = {(s, relay{s, 1), ( relay{s , 1), d), (d, s)} (Figure 7a) represent the work cycle subvectors 
(Si(s, o 1)), (cTj(oi, s), 5i{s, d)), and respectively, {ai{d, s ), ..., ^{d, s), ...) (Figure 7c). 

• TVs case refers to work cycles, for which s demands access to object[j] and then completes 
jobi, where j £ {1,3}. Therefore, the edges in £3 = {(s,relay{s, j)), (relay(s, j), s)} (Figure 7a), 
represent the subvector (<5* (s, object\j])), and respectively, (<jj {object [j]), s), ..., <f>i{object\j], s), 
...) of the work cycle (Figure 7d). 

CS {thread, 2)’s blocking times and request probabilities. We complete the example in 
which we show how the contention subsystem represents the dependencies among the threads in 
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the shared-object system. We refer to an arbitrary job, say job, , that s = thread[n\ carries out 
and explain how the contention subsystem’s request probabilities (7?-s) n e[i,JV] and blocking periods 
(£> s ) nS [ i ; jv] represent the request probabilities, and respectively, the blocking periods in the shared- 
object system. We justify this representation using the work cycle of jobi, when its path is in the 
path partition Vj, for every j £ [1,3]. Note that for each such path partition, the period between two 
consecutive work cycles completed by s is represented by B s (s) = T s (Figure 7). Namely, if s carries 
out jobi and consecutively job ,/, and their first demand requests are Si(s, d), and respectively, <5*/(s, 
d!), the blocking period B s (s) represents the period between these two events (in the contention 
subsystem). 

The case of (s, cq,*) paths. Consider the case where s carries out jobi with path r = (s, Oi, •), 
where r G V 2 , if d is included in r and r £ V 3 , if d is not included in r. We present the request 
probabilities among s, relay(s, 1 ) and d , as well as the blocking times on each of these items in the 
contention subsystem. 

• The probability 7 Z s (s, relay(s, 1)). The probability lZ s (s, relay(s, 1)) = R(s , 01 ) (by the 
definition of R) denotes the contention subsystem event of s demanding access to relay(s, 1), which 
represents s demanding access to cq immediately after jobi’s assignment in the shared-object system 
(figures 7c and 7d, when j = 1). 

• The blocking period B s (relay(s, 1)). Let W(s, cq, d) denote the period during which s blocks 
cq, minus the possible blocking period on d (after the supply of access to cq) in the shared-object 
system. Moreover, let X(s, cq, d) = Pr[(s, cq, d)] • cq..B[2] denote the (possible) blocking period of 
s = thread[n\ to d = object[2 ], after gaining access to cq = object[ 1] in the shared-object system, 
where the probability Pr[(s, cq, d)] = R(s, cq) • i?(cq, d) denotes the event of s demanding access 
to cq and successively to d. Namely, A(s, cq, d) is the time between the work cycle events Si(s, d) 
and </>i(d, s) times the probability of s to demand access to d after gaining access to cq (Figure 7c). 
Thus, in the shared-object system W(s, cq, d) = s..D[l] — X(s, cq, d). Therefore, B s (relay(s , 1)) 
= W(s, cq, d) represents the period during which s blocks relay(s, 1) before possibly demanding 
access to d in the contention subsystem. 

• The probabilities lZ s (relay(s, 1), d) and lZ s (relay(s, 1), s). Let .F( S| 1 ) 2 ) and IF', 1 ^ denote 
the events in which s demands access to d after gaining access to cq in the shared-object system, 
and respectively, to relay(s, 1) in the contention subsystem. The event F' <s 3 2 , in the contention 
subsystem represents the event F'^ s 1 in the shared-object system, and therefore, the probability 
of lF[ s 1 2 ^ is given by lZ s (relay(s, 1), d) = Pr[(s, cq, d)] (Figure 7c). Moreover, when the event 
■F[s 1 2 ) (and thus the event jF( s ,i, 2 )) does not occur (Figure 7d), the respective job is completed and 
s becomes idle or starts a new job with probability lZ s (relay(s, 1), s) = 1 — lZ s (relay(s, 1), d). 
The case of (s, d, •) paths. Consider the case where jobjs path is (s, d, •) £V\. In Figure 7b, s 
demands access to d immediately after it is assigned with jobi. This is represented in CS(t,hread , 
2) by s demanding access to d, with probability 7 Z s (s, d) = R(s, d). Moreover, s blocks d for a 
period of B s {d) = s.B[2] in CS(thread , 2), which represents the period between the events cq(d, s ) 
and (j>i(d , s) in figures 7b and 7c. After the job completion and the release event of d in CS (thread, 
2), s enters, with probability lZ s (d, s) = 1, an idle period (of possibly zero length) until it starts 
carrying out a new job. 

The case of (s, 03 ) paths. Consider the case where s carries out jobi with path r = ( 5 , 03 ) £ V 3 . 
In CS(thread , 2), s demands access to relay(s, 3), with probability !Z s (s, relay(s, 3)) = R(s, 03 ), 
which represents s demanding access to 03 immediately after j obi's assignment in the shared-object 
system (Figure 7d). The blocking period of s on relay(s , 3) is B s (relay(s, 3)) = s.D[3], which 
in the shared-object system represents the time that s is waiting to gain access to 03 and then 
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(a) The H(Sk, k) contention graph, where [ii,i c ] = Rel(Sk, k) \ [k + 1, M] 




{thread[n\, object[ij]) 

<Ji(object[ij], thread[n]) 

; Si(thread[n\,object[k]) 
/ ai(object[k\,thread[n\) 


$i(thread[n\ j 


, 5i(thread[n], object[ij\) 


/ ai (objecb [ij], thread[n ]) 


<&i(thread[n\) 


(b) Case 1 work cycles. 


(c) Case 2 work cycles. 


(d) Case 3 work cycles. 


Figure 8: The contention graph for CS(Sk,k) and the work cycles partitions, P(s,d) = U^ g r i^Ve, 
of (•, s, •) paths, where V\ = {path \ path = (•, s, d , •)}, V 2 = {path \ path = (•, s, object[i ], •, 
d, •) A i £ Rel(Sk, k) \ [k + 1,M]}, V 3 = {path \ path = (•, s, •) A d £ path}, Rel(t,hread, k) = 
[1, M] \ {/c} and d = object[k\. 


blocking it, i.e., the period between the work cycle events <5j(s, 03 ) and <^( 03 , s) (Figure 7d). After 
the job completion and the release event of relay(s, 3) in CS{thread, 2), s enters, with probability 
lZ s (relay(s, 3), s) = 1, an idle period (of possibly zero length) until it starts carrying out a new 
job. 

The contention subsystem CS(thread , d) = ( R(thread , d), ( TZ s )s£thread , ( t3 s )sethread) 1 which 
we described above, represents the dependencies among the thread set, thread , and d = object[2] 
in the shared-object system. 

8.3 The case of systems with M objects 

In this section we prove that Theorem 6 follows from lemmata 7, 8 and 9 (Corollary 10). 

Lemma 7. Consider a contention subsystem CS(Sk, k ) = ( H(Sk, k), (7 Z s ) S £S k > C Bs)ses k ), where 
Sk € {thread}Usk and Sk = {{object[i}} \ i G [1, k — 1]}. Suppose that we are given the shared-object 
system’s blocking and item inter-demand periods, as well as the request probabilities R. It holds 
that CS{Sk, k ) represents the dependencies among the threads in the shared-object system and the 
system’s state. 

Proof. We show a mapping of the shared-object system’s state to the contention subsystem CS(Sk, k). 
Given the shared-object system’s blocking and item inter-demand periods, as well as the request 
probabilities, we construct the contention subsystem CS(Sk, k) = ('H(Sk, k), (lZ s ) S £s k , (®s)ses k ) 
based on the work cycles related to jobs with (#,s,*) paths, where s € Sk and Sk = {{object[i}} 
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| i G [1, At — 1]}- We explain the representation of work cycles by the contention graph H(Sk, 
k), as well as the representation of the shared-object system’s request probabilities and state, i.e., 
blocking, pairwise inter-demand period and delay, by (T Z s ) s eS k , and respectively, ( B s ) s eS k in the 
contention subsystem. This construction is the mapping that proves the lemma’s statement. 

The proof is organized as follows. In the first part, we construct the contention graph R(Sk, 
k) = (V, £) using the work cycles related to (»,s, •) paths. Moreover, in the second part we show 
that (lZ s ) S £s k and ( B s ) se s k represent the dependencies and the shared-object system’s state with 
respect to (•, s, •) paths. In our construction, we assume the knowledge of the item inter-demand 
periods T v , the system’s state delay and blocking periods, as well as the request probabilities R. 
The graph %(Sk,k) represents the work cycles related to the (•,«,•) paths. Let 
H(Sk,k) = (V, £) be the contention graph of CS(Sk, k) (Figure 8a) and consider s to be an 
arbitrary element of S}. Recall that Rel(thread , k) = [1, M\ \ {fc} and Rel ({ object[£}}, k) = 
[£ + 1 , M] \ {At}, where l G [1, At). Moreover, let V(s, object[k ]) = L>ee[i, 3 ]'Pe be a path partition of 
(•, s, •) paths, where Vi = {path \ path = (•, s, object[k], •)}, V 2 = {path \ path = (•, s, object[£\, 
• , object[k \, •) A £ G Rel(Sk, k) \ [k + 1,M]} and V 3 = {path \ path = (•, s, •) A object[k\ £ 
path}. We explain the representation of the thread work cycles for jobs with (•, s, •) paths by the 
graph n(S k , k). 

We define the elements of V and £. Let V = U se s fe V s , where V s = {s, object[k}} U {relay(s,j) 

| j G Rel(Sk,k)}. The nodes relay(s, j), j G Rel(Sk, k), are s’s distinct copies of a relay object, 
object[j ], which allow us to distinguish paths with respect to threads. The edges £ = U s eS k £s follow 
the three path partition cases, i.e., £ s = U^n ^\£i, where £\ = {(s, object[k}), ( object[k ], s)}, £2 
= {(s, relay(s, j), ( relay(s , j), object[k\), ( object{k ], s)} and £3 = {(s, relay{s , j)), ( relay(s , j), 
s)}. Let jobi be a job that thread[n] carries out, such that item s is either included in job £s object 
vector or s = thread[n ], and let cycle(thread[n\, jobi) be the respective work cycle. The edges in 
the sets £g, where £ G [1,3], represent the events in every possible cycle(thread[n], jobi) after the 
supply of item s, if s is an object, or after the assignment of jobi to s, if s = thread[n\. For brevity, 
we refer to both events as the supply of item s. The edge sets are defined according to the three 
sets of the path partition V(s , object[k ]) = as follows. 

• "Pi’s case refers to work cycles, for which thread[n] demands access to object[k ], immediately 
after the supply of item s. When thread[n] gains access to object[k ], jobi might require 
thread[n\ to demand access to another object, object[k'], where k' G [k + 1, M], Upon jobj s 
completion thread[n\ releases any acquired object (event <&i(thread[n\) of the work cycle). 
Thus, the edges ( s , object[k ]) and ( object[k \, s) of £\ (Figure 8a) represent the subvec¬ 
tor (Si(thread[n], object[k ])), and respectively, the subvector (ai(object[k], thread[n ]), ..., 
&i(thread{n})) of the work cycle (Figure 8b). 

• TVs case refers to work cycles for which thread[n\, after the supply of item s, demands access 
to object[j ], where j G Rel(Sk, k)\ [fc +1, M], and subsequently to object[k]. Note that, by the 
definition of TV thread[n ] might also demand access to other objects in ( object[j ], object[k\) U 
[object[k+ 1], object[M. Thus, the edges (s, relay(s, j), ( relay(s , j), object[k ]), ( objectjk ], s) 
of £2 (Figure 8a) represent the work cycle subvectors ( 6 i(thread[n ], object[j})), ( <Ji(object[j ], 
thread[n ]), ... 6 i(thread[n \, object[k])), and respectively, (ai (object[k\, thread[n ]), ..., 
<&i(thread[n ])) (Figure 8c). 

• V 3 S case refers to work cycles, for which thread[n\, after the supply of item s, demands 
access to object[j ], where j G Rel(Sk,k). Note that, by the definition of TV thread[n ] 


23 



might also demand access to other objects in ( object[j] , object[M , but not to object[k\. 
Thus, the edges (s, relay(s, j)) and ( relay(s , j), s) of £3 (Figure 8a), represent the subvec¬ 
tors ( 5i(thread[n ], object[j ])), and respectively, (ai(object\j]), thread[n\), ..., 3>i(thread[n])), 
where j £ Rel(Sk, k) of the work cycle (Figure 8d). 

(Jl s )seSk and (^s)seSfc represent the dependencies and the shared-object system’s state 
with respect to (•, s, •) paths. We refer to an arbitrary job, say jobi, that thread[n] carries out 
and explain how the contention subsystem’s request probabilities ( lZ s ) se s k and blocking periods 
(B s ) S £s k represent the request probabilities, and respectively, the blocking periods in the shared- 
object system. We verify this representation using the work cycle of jobi , when its path is in the 
path partition Vj, for every j £ [1,3]. We remind that throughout this proof we refer to the supply 
of item s as the supply of access to object[£\ for thread[n\ , if s = object[£\, £ £ [l,k — 1], or the 
assignment of jobi to thread[n], if s = thread[n\. 

Note that for each such path partition, the period between two consecutive work cycles for jobs 
that include item s is represented by B s (s) = T s (Figure 7). Namely, the blocking period B s (s) 
in the contention subsystem represents the period between the event of thread[n] releasing item 
s due to jobi , and consecutively, another thread, say thread[n'], releasing item s due to a job, 
say jobi>, in the shared-object system. We show this representation by looking into three cases of 
path partitions, i.e., (1) paths (•, s, object [fe], •) in 'Pi, (2) paths r = (•, s, object[j], •), where 
j £ Rel(Sk,k) \ [k + 1 ,M\ (r £ V 2 , if object[k] is included in r and r £ V 3 , if object\k ] is not 
included in r) and (3) paths r = ( thread[n ], •, s, object[j], •), where j £ Rel(Sk,k), and object[k\ 
£ r, i.e., r e V 3 - 

(1) Consider the case where jobj s path is in partition Vi, i.e., thread[n] carries out jobi with 
path (•, s, object[k\, •). In Figure 8b, thread[n} demands access to object[k] immediately 
after the supply of item s. This event is represented in the contention subsystem by thread[n\ 
demanding access to object[k\, with probability lZ s (s, object[k ]) = R(s, object[k ]) / K{Sk , k) 
(by the definition of R), where K(Sk, k) = E v eRei(s k ,k)u{k} R{ s > object[v\) is a normalizing 
constant . Furthermore, thread[n\ blocks object[k\ for a period of B s {object[k ]) = s.B[k\ in the 
contention subsystem, which represents the period between the events er i(object\k\, thread[n ]) 
and $i(thread[n ]) in the work cycles presented in figures 8b and 8c. This is due to the fact that 
s.B[k] also considers requests for accessing object[k\ while following either job path ( thread[n \, 
• , s, object[k ], •) or (■ thread[n\ , s, •, object[k ], •) (Lemma 4). After the job completion and 
the release event of object[k], another supply event of item s occurs or it becomes idle. Namely, 
if s = thread[n ], s will either start carrying out the next pending job or it will become idle, 
and if s = object[£\, the thread on s’ s queue top will gain access to s in case the queue is not 
empty, otherwise s will become idle. These are certain events in the shared-object system and 
therefore occur with probability 1Z S (object [fc], s) = 1 in the contention subsystem. 

(2) Consider the case where thread[n] carries out jobi with path r = (•, s, object[j ], •), where 
j £ Rel(Sk,k) \ [k + 1,M\. Note that r £ V 2 , if object[k\ is included in r and r £ P 3 , if 
object[k] is not included in r. We present the request probabilities among s, relay(s, j) and 
object[k ], j £ Rel(Sk,k) \ [k + 1,M], as well as the blocking times on each of these items in 
the contention subsystem. We present (i) the probability 7 Z 8 (s, relay(s, j)), (ii) the blocking 
period B s {relay{s, j)) and (iii) the probabilities lZ s (relay{s, j), object[k ]) and 7 Z s (relay(s, j), 
s). 

(i) The probability 7 Z s (s, relay(s, j)) = R(s, object[j ]) / K(Sk, k) (by the definition of 
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R) denotes the contention subsystem event of thread[n\ demanding access to relay(s, 
j) after the supply of item s, which represents thread[n] demanding access to object[j] 
immediately after the supply of item s in the shared-object system (figures 8c and 8d, 
when j € Rel(Sk, k) \ [k + 1, M]). 

(ii) Consider the case where thread[n] might demand access to objects in ( object[j ], object[M, 
after the supply of object[j] in the shared-object system. Let W(s , object[j ], object[k\) 
as the period during which thread[n] blocks object[j} minus threadings possible blocking 
period on object[k] in the shared-object system, due to a path in "P 2 - We refer to X(s, 
object[j ], object[k ]) = SPr[(s, object[j ], •, object[£], object[k])\ ■ object[£\.B[k] as the 
possible blocking period of thread[n] to object[k], after gaining access to object[j] in the 
shared-object system. Note that the probability Pr[(s, object[j], • , object[£], object[k])] = 
R(s, object[j ]) • ['EiZfi R k {object [£] 1 object[j ])] • R(object[£], object[k ]) denotes the event in 
which thread[n], after the supply of item s, demands access to object[j}. possibly to other 
objects in ( object[j ], object[£]), to object{£] and successively to object[k\. Namely, let X(s, 
object[j ], object[k ]) denote the time between the work cycle events 6 i(thread[n], object[k ]) 
and <fri(thread[n\) times the probability of thread[n\ to demand access to object[j ], after 
gaining access to s, and subsequently to object[k\ (Figure 8c). Thus, in the shared-object 
system IF(s, object[j], object[k\) — s.D[j\ — X(s, object[j], object[k ]). Therefore, the 
period during which thread[n\ blocks relay(s, j), after the supply of item s, and before 
possibly demanding access to object[k\ in the contention subsystem is represented by 
B s (relay(s, j)) = W(s, object\j ], object[k}). 

(iii) Let £F^s,j,k) and j k ^ denote the event in which thread[n ], demands access to object[j ], 

after the supply of item s, and subsequently to object[k\ in the shared-object system, and 
respectively, the event in which thread[n\ , after the supply of item s, demands access to 
relay(s , j) and consecutively to object[k] in the contention subsystem. Note that thread[n\ 
might demand access to other objects in ( object[j ], object[k ]) in the shared-object system. 
The event contention subsystem F[ s ■ represents the event ■ k ^ in the shared-object 

system, and therefore, the probability of J-^ s ■ k ^ is given by lZ s (relay(s, j), object[k ]) 
which equals to Pr[(s, object[j ], object[k])} (Figure 8c). Moreover, when the event j k ^ 
(and thus the event J r ( s ,j.k)) does not occur (Figure 8d), the respective job is completed 
and thread[n] becomes idle or starts a new job with probability lZ s (relay(s, j), s) = 1 — 
lZ s (relay(s, j), object[k}). 

(3) Consider the case where thread[n\ carries out jobi with path r = ( thread[n ], •, s, object[j ], •), 
where j € Rel(Sk, k), and object[k\ ^ r, i.e., r G V 3 . In the contention subsystem, thread[n], 
after the supply of item s, demands access to relay(s, j), with probability lZ s (s, relay(s, j)) = 
R(s, object[j ]) (by the definition of R), which represents thread[n\ demanding access to object[j] 
immediately after the supply of item s in the shared-object system (Figure 8d). The blocking 
period of thread[n] on relay(s, j) is B s (relay(s, j)) = s.D[j ], which in the shared-object system 
represents the time that thread[n\ is waiting to gain access to object[j ], blocks it while it that 
does read well here. Too many ’it’. Please use the name thread[n\ and object[j ]. Another way 
to go is to use (conditional) events... the time between 6 i }n (d\s) and .... possibly demands 
access to other objects (except for object[k ]) and releases all of its acquired objects. Namely, 
B s (relay(s , j)) represents the period between the work cycle events Si(thread[n\ , object[j]) and 
&i(thread[n}) : after the supply of item s (Figure 8d). After the job completion and the release 
event of relay(s, j) in the contention subsystem, either another thread gains access to item s 
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or item s becomes idle (no thread is accessing it), i.e., this event is certain to happen in the 
shared-object system. 

The CS(S k , k) = ( H(S k , k), (H s ) seSk , (B b ) S es k ) contention subsystem, which we described 
above, represents the dependencies among the item set, S k , and object [fc] in the shared-object 
system. Therefore, the proof is complete. 

Lemma 8. The framework of Baynat and Dallery can approximate the delay s.D[k\ and inter¬ 
demand period s.T[k] through the contention subsystem CS(S k ,k). 

Proof. We first give the definition of a Ramesh-Perros subsystem (RPS) that is introduced in [18] 
and show that a contention subsystem can be directly mapped to an RPS. Rainesh and Perros 
show that we can find the pairwise inter-demand period, and delay of (blocking) communications 
in an RPS using a framework proposed by Baynat and Dallery in [4]. Thus, we can use the Baynat- 
Dallery framework to estimate the values of s.T[k] and s.D[fc], for every s G S k , given a contention 
subsystem CS(S k ,k) as an input. In Appendix 4.2, we present an adapted version of the Baynat- 
Dallery framework (Algorithm 2) and explain the calculation of the delay s.D[k\ and inter-demand 
period s.T[k], given the contention subsystem CS(Sk , k). 

A Ramesh-Perros subsystem is defined as follows. Let {o[l], ..., o[M]} be a set of servers, such 
that tier -k includes only server o[fc], where k € [1 , M], and {t[l], ..., t[N}} be a set of clients. 
Moreover, let s k = {{o[l]},..., {o[k — 1]}} (i.e., s k = 0 when k = 1) and S k G {{£[1], ..., f[.A]}} 
U s k . A Ramesh-Perros subsystem, is denoted by 1ZV(S k , k) = (H(Sfc, k), (R x ) x es fc > and 

defined as follows: 

(1) The RPS graph tt(S k , k) = (V, E) has the set of vertices V = U x€ s fc V x , and the set of edges E = 
Uses* E x , such that V x = {a:} U {o[fc]} U {relay(x,j) \ j G Rel(S fe , k)} and E x = E] U E^ U E|, 
where Rel({f[l], ..., i[JV]}, k) = [1 ,M\ \ {fc}, Rel({o[i]}, k) = [i + 1 ,M] \ {fc}, where o[i\ G s k , 

= {(®, »[£;]), (o[k], x)}, Ej. = U ieRel(5fcife) \ [fc+ljM ]{(a:, relay{x, j)), ( relay(x,j ), o[k)), (. o[k ], 
a-)}, and E^ = U_ 7 - eRe i(s fc) fe){(a;, relay(x, j)), ( relay(x , j ), x)}. Note that H(S' fc , k) is a simple 
directed graph, i.e., there are no multiple edges between two vertices. 

(2) The RPS request probability matrices R x for H (S k , k) = (V, E) and x G S k , where, E. x [v k ,ve\ is 
the probability that the process at vertex v u G V x forwards the client request to the server at 
vertex vi G V x , for an edge (v u , vg) in E x . 

(3) The RPS blocking periods , (B x ) xe s fc , where B x is a function over V x , and x G S k refers to client 
or server processes. Note that the edges in E(. form a directed circle in the graph H (S k , k), where 
x G S k and i G [1,3]. A client request of x G V x to server o[k], is blocking o[k\ for B x (o[fc]) = 
x.B[fc] time. A client request of x G V x to server relay(x, j), blocks that server for a period of 
B X {relay{x, j)) = x.D[j], if j G [k + l,M] and B X (relay(x, j)) = W(x, relay(x, j), o[k ]) otherwise 
(j G Rel(5fc, k)\[k + 1, M\). Once the servers return to process x G V x with an answer, after 
a period of B x (a:) = T x , process x sends a new client request to a server in V x \ {x}. Note that 
x.B[fc], £.D[j], W(x, relay(x,j ), o[k\) and T x are functions of x and k, x and j, x, relay(x,j ) and 
o[k], and respectively, x. 

Note that this definition of an RPS is a special case of the definition given in [18], which 
is adapted to our purposes. A contention subsystem CS(S kl k) is directly mapped to an RPS 
TTP(S k ,k), by setting o[k\ = object[k\, t[n] = thread[n], a message to be a demand request, H (S k , 
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k) = H(S k ,k), (R x )xes k = (7 Z s )seS k and (B x ) xeSk = (£ s ) se s fc , where k £ [1 ,M] and n £ [1,7V]. 
Namely, we set ;r.B[fc] = s.B[k], x.T)\j] = s.D[j], W(x, relay(x,j), o[k ]) = W(x, relay(s,j), object[k ]) 
and l x = T Sl where x £ Sk equals to the respective s £ Sk following the mapping that we described 
above. 

Lemma 9. The running time of each framework iteration is 0(M ■ TV 4 ). 

Proof. The running time per iteration of the Baynat and Dallery algorithm [4] is in 0(\stations\- 
|dasses| 3 ), where \stations\ and |cTasses| are the numbers of the network’s stations and classes, re¬ 
spectively. In the context of shared-object systems, the number of stations , |stations |, corresponds 
to the number of vertices |Vj of the contention graph r H(Sk,k) = (V, £) and the number of classes 
to Sk s cardinality, where k £ [1 ,M\. Thus, the running time of each iteration is in 0(|Vj • |*Sfc| 3 ). 
Notice that |V| = TV + TV • (M — 1) + 1 and \Sk\ = TV, if Sk = thread , and |V| = M — t + 1 and 
|Sfc| = 1, if Sk = {object[£]} : where £ £ [1 ,k— 1]. The result follows by taking the maximum |V| 
and l^l of these two cases. 

Corollary 10. Lemmata 7, 8 and 9 imply Theorem 6. 

9 Finding an s-OSE 

We present a procedure (Algorithm 3) for finding e-OSEs. We give a detailed explanation of 
Algorithm 3 (Section 9.1) and analyze its running time (Lemma 11 of Section 9.5). We also detail the 
algorithm’s functions (sections 9.2, 9.3, and respectively, 9.4), which are initializeSystemState (), 
updateStates (), and recalcB() and recalcTQ. 

9.1 The e-OSE solver 

The procedure always halts and computes an approximated equilibrium, e-OSE, when such is 
reachable. Namely, whenever the job arrival and completion rates become equal, the procedure 
returns the system state in an e-OSE, or indicates that an OSE is not a state that the system can 
be in. The procedure sets initial values to the system state, c[s, d\, and then uses the proposed 
methods (sections 5 to 8) for estimating c[s, d] iteratively, until convergence. The decision on when 
to stop considers the system inter-demand period, {Titem}itemev\{object[M]}i an d stops whenever 
there is no item £ V\{object[M ]} for which the change in 7 Item is greater than e since the previous 
iteration, where ( S = (V. E ) in the acquisition graph. 

The procedure’s input includes the system parameters, i.e., number of threads TV and objects 
M, the jobs, jobi , i £ [1, J], and their arrival rates {Ai jn }j g h j] >nS [i : jv] to each thread[n ], n £ [1, TV], 
as well as the request probability matrix, R. The procedure’s output includes the delay D, inter¬ 
demand period T and blocking period B between all system items, as well as, the item inter-demand 
periods T v , for every item v ^ object[M], 

The procedure starts with a system state that represents the case in which all queues are empty 
(see the function initializeSystemState ()). It then estimates the state of a system in which threads 
can block one another, and the delay grows as more requests are pending in the queues. The main 
part of the pseudocode (Algorithm 3) consists of a repeat-until loop (lines 8-13) that follows the 
procedure’s initialization (line 6). Before the procedure can return its output value, the loop has 
to end either when the e-OSE conditions are satisfied or when the procedure detects that an OSE 
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cannot be reached. Each iteration aims at further improving the e-OSE estimation. The repeat- 
until loop then exits when no item changes by at least e between every two iterations (and thus the 
system state satisfies the conditions of an approximated equilibria). 

In every iteration, the procedure computes (i) the blocking periods of demand requests for ob¬ 
jects, s.B[k] (the function updateStates ()), (ii) the thread inter-demand periods, Tthread[n] (the 
function augmntThreadBlock( )), and (iii) the object inter-demands, Tobject\j] (which is the func¬ 
tion updateStates()), where (s, object[k ]) is an edge of the acquisition graph Sf, n £ [1, iV] and 
j £ [1, M — 1]. The repeat-until loop repeats the steps (i), (ii) and (iii), which deals with inter¬ 
dependencies using alternating backward and forward iterations. Namely, it resolves the forward 
dependencies in which (s, object[k ])'s blocking period, s.B[k], depends on object[£]’s delay by it¬ 
erating backward, where £ £ (k,M\, i.e., starting from k = M and counting downwards, we can 
estimate s.B[k], because (in a system that its state satisfies the equilibrium conditions) all of 
(s, object[k])'s forward dependencies can be resolved. Similarly, its uses forward iterations for re¬ 
solving backward dependencies with respect to d 's item inter-demand period, T 0 bject[k 1 7 because all 
of object[k \s backward dependencies are resolved. Moreover, the function augmntThreadBlock {) 
allow the repeat-until loop to stop whenever the job inter-arrival time becomes less or equal than 
the time it takes that thread to complete such jobs, i.e., there’s no OSE. 

The blocking period estimation, step (i), starts from the last system object, object[M ], where 
there are no dependencies on the delay of subsequent demand requests. For every s such that 
( s,object[M] ) is an edge of Sf, we calculate the delay, s.D[M ], and the pairwise inter-demand 
period, s.T[M ], of demand requests to object[M] through the updateStates(#B , k, thread , object , 
{ T v } V £aiiStates 7 R) function (line 9), where allStates = {thread[n) \ n £ [l,iV]} U {object[k\ | 
k £ [1,M — 1]}. Therefore, the procedure can compute the blocking period of demand requests to 
object[M — 1], s.B[M — 1], because it has just estimated the dependencies of subsequent demand 
requests for object[M}. Repeating this process for k = M — 1,... ,1, the procedure compute the 
blocking periods of demand requests to any object[k ], s._B[fc], where k £ [1 ,M]. Note that this 
is possible, since in every step k, k = M — 1,... ,1, of this for loop, we have already computed 
the demand request delays and the inter-demand periods of every object[k r ], object[k\.D[k'], and 
respectively, object[k].T[k'], where k' £ [fc + 1 ,M\ (Section 6). 

After computing the blocking periods, delays and pairwise inter-demand periods, the proce¬ 
dure estimates the threads’ inter-demand periods (Section 7), which can be used to estimate the 
job completion rates. It does this in step (ii), through the function augnmtThreadBlock(T,iXi n , 
blocking(n)) (line 10) and by using the thread idle probabilities (Appendix 4.1), where blocking{n) 
= A + ’E^L 1 R{thread[n], object[k ]) • thread[n].D[k\. Moreover, note that the augmntThreadBlockQ 
function (Section 7) allows the repeat-until loop to stop whenever it detects that the inter-arrival 
time of jobs to a thread becomes less or equal than the time it takes that thread to complete 
such jobs (i.e., there’s no OSE). Note that the repeat-until loop breaks when the Boolean variable 
loopEnd is true (line 10). 

Step (iii) uses the updateStates() function (line 11) for calculating the object inter-demand 
periods, after calculating the new estimates for the delay and the pairwise inter-demand periods. 
The procedure estimates the inter-demand period T 0 bject[k], for each object[k\, k £ [1 ,M — 1], via 
the item inter-demand period of demand requests for items that precede object[k] in a job path (•, 
item[j], object[k ], •) (Section 7). Therefore, the procedure estimates the inter-demand periods for 
each object[k\ in the order of k = 1,..., M — 1. 

Once the procedure verifies the satisfaction of the e-OSE conditions (Section 2), the repeat-until 
loop end (line 12) and the procedure returns. 
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9.2 The initializeSystemState () function 

The procedure initializeSystemStateQ initializes Algorithm 3 assuming that there is no contention, 
i.e., all queues have zero length in the shared-object system (lines 6 and 14). In the context 
of shared-object systems, no contention means that the delay of each demand request equals its 
blocking period, s.D[k] = s.B[k], for all items s and d = object[k\. Thus, the procedure initializes 
the blocking periods s.B[k] through the function initRecordQ (lines 15-16). It uses the average 
demand completion period f s ,object[k] , as proposed in Section 6, which is a constant. That is, 
s.B[M] = A + R(s,d ) • R(d,d) ■ f s> d, if d = object[M} and s.B[k] = A + R(s,d) ■ R(d,d) ■ 
/ S) d+E^ fc+1 i?(d, object[£])-d.D[£], if d = object[k\, k £ [1, M— 1], where s £ {thread[n\ \n£ [1, iV]} 
U {object[i] | i £ [1, k — 1]}. Namely, the blocking period of a request to access (demand) the last 
object equals to the acquisition period plus the average demand completion period on that object 
and the blocking period of a demand to d = object[k \, k £ [1 ,M — 1], is recursively computed as 
the sum of the acquisition period, A, the average demand completion period R(s,d) ■ R{d,d) ■ f St d 
plus the average blocking period of a subsequent demand to one of the following objects d.B[£], 
t £ [k+ 1, M], weighted by the probability of sending such a demand, R(d, object[£]), £ £ [k+ 1, M], 
Moreover, the s.T[/c] pairwise inter-demand periods are set to equal s.B[k]. 

The Tthread[n] (thread) inter-demand periods are computed (line 17) through the function 
augmentThreadBlock{Yii\i^ n , blockingQn)), where blocking(n ) is defined in line 4. Furthermore, 
the T 0 bject[j] (object) inter-demand periods are set to E eL j+1 R{object\j], object{£ ]) • object[j].T[£ ] 
(line 18). Algorithm 3 iteratively finds the correct values of the pairwise and item inter-demand 
periods (arbitrary initialization of the system’s state is proposed in [18]). 

9.3 The updateStates () function 

This function updates the blocking periods and the item inter-demand periods with respect to 
object[k] (lines 19-25). If the input tag is the function updates the blocking periods s.B[k] for 
every s € {thread[n} \ n £ [1, AT]} U {object[j] : j £ [1 ,k — 1]}, as in Section 6. Otherwise, if the 
input tag is #T, the function updates the inter-demand period of d = object[k\, Td, as in Section 7. 

The procedure updateStatesQ defines the contention subsystem for every set of item sources 
Sk using the function defContentionSubsystem( ) (line 20 if Sk = thread and line 23 if Sk — 
{object[j}}, j £ [1 ,k — 1]). It then calculates the delay, s.D[fc], and pairwise inter-demand period, 
s.T[k], for every s £ Sk using the BDFQ function of Appendix 4.2 (line 21 if Sk = thread and 
line 24 if Sk = {object[j}}, j £ [1 ,k — 1]). The procedure ends with the computation of the blocking 
periods or the inter-demand periods, depending on the tag with which the procedure was called. 

9.4 The recalcBQ and recalcTi) functions 

We present the exact formulas that give the first three moments of s.B[k], i.e., the function 
recalcBQ, and Td, i.e., the function recalcTQ, where d = object[k\. We find these formulas using 
the equations in sections 6, and respectively, 7. 

Let F St d = A + R(s, d) ■ R(d, d) ■ f Si d, be the sum of the acquisition time and the job completion 
period times the related probabilities (F St d can be bound by a constant and thus can be treated as 
such). The first three moments of the blocking time are E(s.B[M] m ) = ( F St M) m , for m = 1,2,3, 
and E(s.B[k ]) = F S)d + E^f =fc+1 (Pr[(s, •, d, object[k'])\ ■ E(d.B[k'}), E(s.B[k} 2 ) = {F S}0bject[d] ) 2 
+ E£f =fc+1 Pr[(s, ., d, object[k'])\ (E{d.D[k '] 2 ) + 2 F s4 • E{d.D[k’})) and E{s.B[k ] 3 ) = (F s , d ) 3 + 
E£f =fc+1 Pr[(s, ., d, object[k'])] ( E(d.D[k f) + 3 (F s , d ) 2 • E{d.D[k']) + 3 F s , d ■ E(d.D{k'} 2 )), if d = 
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object[k\ and k E [1, M — 1]. Moreover, the first three moments of 7d, where d = object[k ], are E(Td) 

— s£arrivals(k ) d') • {F s ^d -^(Ts)) / s£arrivals(k ) ^(^5 ^0)? ) = (^arrivals(k) ^(^5 

d) • ((f m ) 2 + £(7; 2 ) + 2f s , d • £ 7 ( 7 ;)) / (s se arrivals(k) ^0) and EQT'f ) s£arrivals(k ) k^(A 

d) ■ {(F s , d ) 3 + E(T)l) + 3(F Sjd ) 2 E(Ts) + 3F s , d E(T?)) / (S sear . H „ a(s ( fc ) w(s, d)). We remind that 
arrit>aZs(object[fc]) = thread U {object[j] | j € [1, fc — 1]} and w(s, object[fc]) = i?(s, o6ject[fc])-s.T[/c] 
(Section 7). 

9.5 Running time 

Notice that the running times of Algorithm 2 (Baynat-Dallery framework) and Algorithm 3 depend 
on the number of iterations of these algorithms. Lemma 11 bounds the procedure running time for 
one iteration. 

Lemma 11. The running time of one iteration of Algorithm 3 is in 0(M 2 • N 4 + M 3 ). 

Proof. We look at the running time of each step of the repeat-until loop to find the algorithm’s 
running time. Steps (i) and (iii) call the function updateStatesQ so at most M; M, and respectively 
M — 1 times. Note that the function updateStatesQ calls at most 1 + (M — 1) times the function 
BDF (), because the input parameter k, which denotes the object whose state is to be updated, 
is at most M (line 22). The first call is done by setting Sj. = thread and (at most) M — 1 calls 
are done by setting Sk = {object[j]}, j £ [1 ,k — 1]. The BDFQ function has running time in 
0(M ■ N 4 - If) and 0(M -If) (Lemma 9), when BDFQ is called for Sk = thread , and respectively, 
for Sk = {object[j]}, j £ [1, k — 1], where If denotes the maximum number of framework iterations 
(Lemma 9). It also holds that the function augmntThreadBlockQ is called N times in step (ii) 
and its the running time is practically constant (see Appendix 4.1 with respect to the findings 
of Latouche and Ramaswami [13]). Thus, the running time of one iteration of Algorithm 3 is in 
0{M ■ (M • N 4 ■ I f ) + N + M ■ ((M - 1) • M ■ //)) = 0(M 2 ■ N 4 + M 3 ). 

10 Conclusions 

We consider a resource allocation problems that can be modeled as generalized dynamic dining 
philosophers problems. We formulate questions that are associated with equilibrium situations in 
such systems, where input and output rates match. We believe that the way we find the equilibrium 
as well as estimate the delay and throughput in such systems can be the basis for an analysis 
of further generalizations of the problem studied here, such as the ones that are described in the 
literature on resource allocation, e.g., non-sequential scheduling, such as parallel resource acquisition 
(2-phase locking) and resource acquisition that is reactive to contention conditions [12, 15, 10, 11, 
17], 
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Algorithm 3: The procedure for finding an £-OSE 

1 Input: Number of objects, M and threads, A; jobs {job,) ie \i n and their arrival rates to 
threads {Aj )n } ne hjvi,ie[i,ji; R request probability matrix; 

2 Variables: The item states are recorded in the arrays thread[ 1 , A] and object[ 1 , M — 1 ], 
such that thread[n] is of the form ((inter-demand period) T[1,M], (delay) D[1,M\, 

(blocking period) B[1,M]} and object[k\ is of the form ((inter-demand period) T[k + 1 ,M], 
(delay) D[k + 1, M], (blocking period) B[k + 1, M]); T v , item tbs inter-demand period, for 
every v = thread[n\ such that n € [1, A] and v = object[k] such that k £ [1, M — 1]; loopEnd 
(Boolean) this variable is true when the function augmentThreadBlock {) decides that no 
OSE can be found and thus the loop should stop. 

3 Output: (thread{l, A], object[l, M — 1 ]) 

4 Macros: objThrdSetj) = {{tag, i, r)\{tag = #o A T object {{\ = r) V (tag = #t A T thread [i\ = t)} 
converged(prev, curr) = {tag, i, r) £ prev, {tag, i, t') £ curr : |r — r'| > e) 

blocking{n) = A + T>%L 1 R(thread[n\, object[k]) ■ thread[n\.D[k] 
all States = {thread[n]\n £ [1, A]} U {object[k]\k £ [1 ,M — 1]} 

5 begin 

6 initializeSystemState(N, M, {.)o6 J } ig r 1 n, {A, >n }) 

7 loopEnd ■£- false 

8 repeat 

9 let prevSet t— objThrdSet {) for k = M to 1 do 
updateStates{#B, k, thread, object, {T v }veaiistates, R)', 

10 foreach n £ [ 1 , A] do 

Othread[n]j loopEnd ) t— augmntThreadBlock{T,iXi t n, blocking{n))\ 

11 for k = 1 to M — 1 do updateStates(#T, k, thread, object, {T v } v eaiistates, R)- 

12 until converged{prevSet, objThrdSet {)) V loopEnd = true; 

13 return {thread[ 1 , A], object[l, M — 1 ]) 

14 procedure initializeSystemState(N, M, {.)o&i}?:e[i,J]; {A; >n }) begin 

15 for k = M to 1 do 

16 foreach item s such that ( s,object\k ]) is an edge in td? do 
s •(— initRecord{s, k, thread[ 1, A], object{\, M — 1]); 

17 for n = 1 to A do Tthread[n\ augmentThreadBlock(Tii\i >n , blocking{n))\ 

is for k = 1 to M — 1 do T 0 bject[k\ <— E,fL k+1 R{object[k\,object[£}) ■ object[k].T[l\, 

19 procedure updateStates{tag , k, thrSet, objSet, ( T v ) V £aiiStates, R) begin 

20 CS(thrSet,k) £- 

de f ContentionSubsy stem{thread[l, A], object[ 1, M — 1], k, (7 ~ v ) v Gail States, R)', 

21 {thrSet[n].T[k\,thrSet[n\.D[k]) ne [i y Nj <— BDF{CS{thrSet,k)); 

22 for j = 1 to k — 1 do 

23 CS({objSet\j]}, k) <— 

defContentionSubsystem{{objSet\j]}, object[ 1, M - 1 ],k, {T v ) ve aiistates, R) 

24 (objSet\j].T[k\,objSet\j][n].D[k\) £- BDF{CS{thrSet,k)) 

if tag = then recalcB{k), else if tag = then recalcT{k); 
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